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CORONAVIRUS ISOLATED FROM HUMANS 


PRIORITY CLAIM 


This is a division of co-pending U.S. patent application Ser. 
No. 10/822,904, filed Apr. 12, 2004, and issued as U.S. Pat. 
No. 7,220,852 on Mav 22, 2007, which in turn claims the 
benefit of U.S. Provisional Patent Application No. 60/465, 
927 filed Apr. 25, 2003. Both applications are incorporated 
herein by reference in their entirety. 


STATEMENT OF GOVERNMENT SUPPORT 


This invention was made by the Centers for Disease Con- 
trol and Prevention, an agency of the United States Govern- 
ment. Therefore, the U.S. Government has certain rights in 
this invention. 


FIELD OF THE DISCLOSURE 


This invention relates to a newlv isolated human coronavi- 
rus. More particularly, it relates to an isolated coronavirus 
genome, isolated coronavirus proteins, and isolated nucleic 
acid molecules encoding the same. The disclosure further 
relates to methods of detecting a severe acute respiratory 
syndrome-associated coronavirus and compositions com- 
prising immunogenic coronavirus compounds. 


BACKGROUND 


The coronaviruses (order Nidovirales, family Coronaviri- 
dae, genus Coronavirus) are a diverse group of large, envel- 
oped, positive-stranded RNA viruses that cause respiratory 
and enteric diseases in humans and other animals. At approxi- 
mately 30,000 nucleotides (nt), their genome is the largest 
found in any of the RNA viruses. Coronaviruses are spherical, 
100-160 nm in diameter with 20-40 nm complex club shaped 
surface projections surrounding the periphery. Coronaviruses 
share common structural proteins including a spike protein 
(S), membrane protein (M), envelope protein (E), and, in a 
subset of coronaviruses, a hemagglutinin-esterase protein 
(HE). The S protein, a glycoprotein which protrudes from the 
virus membrane, is involved in host cell receptor binding and 
is a target for neutralizing antibodies. The E and M proteins 
are involved in virion formation and release from the host cell. 
Coronavirus particles are found within the cisternae of the 
rough endoplasmic reticulum and in vesicles of infected host 
cells where virions are assembled. The coronavirus genome 
consists of two open reading frames (ORF1a and ORF 1b) 
yielding an RNA polymerase and a nested set of subgenomic 
mRNAs encoding structural and nonstructural proteins, 
including the S, E, M, and nucleocapsid (N) proteins. The 
genus Coronavirus includes at least 13 species which have 
been subdivided into at least three groups (groups I, II, and 
III) on the basis of serological and genetic properties (deVries 
et al., Sem. Virol. 8:33-47, 1997; Fields et al. eds. Fields 
Virology, 3rd edition, Raven Press, Philadelphia, 1323-1341, 
1996; Mahey and Collier eds. Microbiology and Microbial 
Infections, Volume 1 Virology, 9” edition, Oxford University 
Press, 463-479, 1998). 

The three known groups of coronavirus are associated with 
a variety of diseases of humans and domestic animals (for 
example, cattle, pigs, cats, dogs, rodents, and birds), includ- 
ing gastroenteritis and upper and lower respiratory tract dis- 
ease. Known coronaviruses include human Coronavirus 
229E (HCoV-229B), canine coronavirus (CCoV), feline 
infectious peritonitis virus (FIPV), porcine transmissible gas- 
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troenteritis virus (TGEV), porcine epidemic diarrhea virus 
(PEDV), human coronavirus OC43 (HCoV-OC43), bovine 
coronavirus (BCoV), porcine hemagglutinating encephalo- 
myelitis virus (HEV), rat sialodacryoadenitis virus (SDAV), 
mouse hepatitis virus (MHV), turkey coronavirus (TCoV), 
and avian infectious bronchitis virus (IBV-Avian) (Fields et 
al. eds. Fields Virology, 3rd edition, Raven Press, Philadel- 
phia, 1323-1341, 1996; Mahey and Collier eds. Microbiology 
and Microbial Infections, Volume 1 Virology, 9” edition, 
Oxford University Press, 463-479, 1998). 


Coronavirus infections are generally host specific with 
respect to infectivity and clinical symptoms. Coronaviruses 
further exhibit marked tissue tropism; infection in the incor- 
rect host species or tissue type may result in an abortive 
infection, mutant virus production and altered virulence. 
Coronaviruses generally do not grow well in cell culture, and 
animal models for human coronavirus infection are lacking. 
Therefore, little is known about them (Fields et al. eds. Fields 
Virology, 3rd edition, Raven Press, Philadelphia, 1323-1341, 
1996). The known human coronaviruses are notably fastidi- 
ous in cell culture, preferring select cell lines, organ culture, 
or suckling mice for propagation. Coronaviruses grown in 
cell culture exhibit varying degrees of virulence and/or cyto- 
pathic effect (CPE) depending on the host cell type and cul- 
ture conditions. The only human or animal coronavirus which 
has been shown to grow in Vero E6 cells is PEDV, and it 
requires the addition of trypsin to culture medium for growth 
in Vero E6 cells. Moreover, PEDV adapted to Vero E6 cell 
culture results ina strikingly different CPE, with cytoplasmic 
vacuoles and the formation of large syncytia (Hofmann and 
Wyler, J. Clin. Micro. 26:2235-39, 1988; Kusanagi et el., J. 
Vet. Med. Sci. 554:313-18, 1991). 

Coronavirus have not previously been known to cause 
severe disease in humans, but have been identified as a major 
cause of upper respiratory tract illness, including the common 
cold. Repeat infections in humans are common within and 
across serotype, suggesting that immune response to coro- 
navirus infection in humans is either incomplete or short 
lived. Coronavirus infection in animals can cause severe 
enteric or respiratory disease. Vaccination has been used suc- 
cessfully to prevent and control some coronavirus infections 
in animals. The ability of animal-specific coronaviruses to 
cause severe disease raises the possibility that coronavirus 
could also cause more severe disease in humans (Fields et al. 
eds. Fields Virology, 3rd edition, Raven Press, Philadelphia, 
1323-1341, 1996; Mahey and Collier eds. Microbiology and 
Microbial Infections, Volume 1 Virology, 9” edition, Oxford 
University Press, 463-479, 1998). 


In late 2002, cases of life-threatening respiratory disease 
with no identifiable etiology were reported from Guangdong 
Province, China, followed by reports from Vietnam, Canada, 
and Hong Kong of severe febrile respiratory illness that 
spread to household members and health care workers. The 
syndrome was designated “severe acute respiratory syn- 
drome” (SARS) in February 2003 by the Centers for Disease 
Control and Prevention (MMWR, 52:241-48, 2003). 


Past efforts to develop rapid diagnostics and vaccines for 
coronavirus infection in humans have been hampered by a 
lack of appropriate research models and the moderate course 
of disease in humans. Therefore, a need for rapid diagnostic 
tests and vaccines exists. 


SUMMARY OF THE DISCLOSURE 


Anewly isolated human coronavirus has been identified as 
the causative agent of SARS, and is termed SARS-CoV. The 
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nucleic acid sequence of the SARS-CoV genome and the 
amino acid sequences ofthe SARS-CoV open reading frames 
are provided herein. 

This disclosure provides methods and compositions useful 
in detecting the presence of a SARS-CoV nucleic acid in a 
sample and/or diagnosing a SARS-CoV infection in a subject. 
Also provided are methods and compositions useful in detect- 
ing the presence of a SARS-CoV antigen or antibody in a 
sample and/or diagnosing a SARS-CoV infection in a subject. 

The foregoing and other features and advantages will 
become more apparent from the following detailed descrip- 
tion of several embodiments, which proceeds with reference 
to the accompanying figures. 


BRIEF DESCRIPTION OF THE FIGURES 


FIGS. 1A-B are photomicrographs illustrating typical 
early cytopathic effects seen with coronavirus isolates and 
serum from SARS patients. FIG. 1A is a photomicrograph of 
Vero E6 cells inoculated with an oropharyngeal specimen 
from a SARS patient (x40). FIG. 1B is a photomicrograph of 
infected Vero E6 cells reacting with the serum of a convales- 
cent SARS patient in an indirect fluorescent antibody (IFA) 
assay (x400). 

FIGS. 2A-B are electronmicrographs illustrating ultra- 
structural characteristics ofthe SARS-associated coronavirus 
(SARS-CoV). FIG. 2A is a thin-section electron-microscopi- 
cal view of viral nucleocapsids aligned along the membrane 
of the rough endoplasmic reticulum (arrow) as particles bud 
into the cisternae. Enveloped virions have surface projections 
(arrowhead) and an electron-lucent center. Directly under the 
viral envelope lies a characteristic ring formed by the helical 
nucleocapsid, often seen in cross-section. FIG. 2B is a nega- 
tive stain (methylamine tungstate) electronmicrograph show- 
ing stain-penetrated coronavirus particle with the typical 
internal helical nucleocapsid-like structure and club-shaped 
surface projections surrounding the periphery of the particle. 
Bars: 100 nm. 

FIG. 3 is an estimated maximum parsimony tree illustrat- 
ing putative phylogenetic relationships between SARS-CoV 
and other human and animal coronaviruses. Phylogenetic 
relationships are based on sequence alignment of 405 nucle- 
otides of the coronavirus polymerase gene ORF1b (nucleic 
acid 15,173 to 15,578 of SEQ ID NO: 1). The three major 
coronavirus antigenic groups (I, II and III), represented by 
HCoV-229E, CCoV, FIPV, TGEV, PEDV, HCoV-OC43, 
BCoV, HEV, SDAV, MHV, TCoV, and IBV-Avian, are shown 
shaded. Bootstrap values (100 replicates) obtained from a 
5096 majority rule consensus tree are plotted at the main 
internal branches of the phylogram. Branch lengths are pro- 
portionate to nucleotide differences. 

FIG. 4is a pictorial representation of neighbor joining trees 
illustrating putative phylogenetic relationships between 
SARS-CoV and other human and animal coronaviruses. 
Amino acid sequences of the indicated SARS-CoV proteins 
were compared with those from reference viruses represent- 
ing each species in the three groups of coronaviruses for 
which complete genomic sequence information was available 
[group 1: HCoV-229E (AF304460); PEDV (AF353511); 
TGEV (AJ271965); group 2: BCoV (AF220295); MHV 
(AF201929); group 3: infectious bronchitis virus (M95169)]. 
Sequences for representative strains of other coronavirus spe- 
cies, for which partial sequence information was available, 
were included for some of the structural protein comparisons 
[group 1: CCoV (D13096); FCoV (AY204704); porcine res- 
piratory coronavirus (Z24675); group 2: HCoV-OC43 
(M76373, L14643, M93390); HEV (AY078417); rat coro- 
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navirus (AF207551)]. Sequence alignments and neighbor 
joining trees were generated by using Clustalx 1.83 with the 
Gonnet protein comparison matrix. The resulting trees were 
adjusted for final output using treetool 2.0.1. 

FIGS. 5A-C are photomicrographs illustrating diffuse 
alveolar damage in a patient with SARS (FIGS. 5A-B), and 
immunohistochemical staining of SARS-CoV-infected Vero 
E6 cells (FIG. 5C). FIG. 5A is a photomicrograph of lung 
tissue from a SARS patient (x50). Diffuse alveolar damage, 
abundant foamy macrophages and multinucleated syncytial 
cells are present; hematoxylin and eosin stain was used. FIG. 
5B is a higher magnification photomicrograph of lung tissue 
from the same SARS patient (x250). Syncytial cells show no 
conspicuous viral inclusions. FIG. 5C is a photomicrograph 
ofimmunohistochemically stained SARS-CoV-infected cells 
(x250). Membranous and cytoplasmic immunostaining of 
individual and syncytial Vero E6 cells was achieved using 
feline anti-FIPV-1 ascitic fluid. Immunoalkaline phosphatase 
with naphthol-fast red substrate and hematoxylin counter 
stain was used. 

FIGS. 6A-B are electronmicrographs illustrating ultra- 
structural characteristics of a coronavirus-infected cell in 
bronchoalveolar lavage (BAL) from a SARS patient. FIG. 6A 
is an electron micrograph of a coronavirus-infected cell. 
Numerous intracellular and extracellular particles are 
present; virions are indicated by the arrowheads. FIG. 6B is a 
higher magnification electronmicrograph of the area seen at 
the arrow in FIG. 6A (rotated clockwise approximately) 90°. 
Bars: FIG. 6A, 1 um; FIG. 6B, 100 nm. 

FIGS. 7A-C illustrate the organization of the SARS-CoV 
genome. FIG. 7A is a diagram of the overall organization of 
the 29,727-nt SARS-CoV genomic RNA. The 72-nt leader 
sequence is represented as a small rectangle at the left-most 
end. ORFsla and 1b, encoding the nonstructural polypro- 
teins, and those ORFs encoding the S, E, M, and N structural 
proteins, are indicated. Vertical position of the boxes indi- 
cates the phase of the reading frame (phase 1 for proteins 
above the line, phase two for proteins on the line and phase 
three for proteins below the line). FIG. 7B is an expanded 
view of the structural protein encoding region and predicted 
mRNA transcripts. Known structural protein encoding 
regions (dark grey boxes) and regions and reading frames for 
potential products X1-X5 (light gray boxes) are indicated. 
Lengths and map locations of the 3'-coterminal mRNAs 
expressed by the SARS-CoV are indicated, as predicted by 
identification of conserved transcriptional regulatory 
sequences. FIG. 7C is a digitized image of a nylon membrane 
showing Northern blot analysis of SARS-CoV mRNAs. Poly 
(A)+ RNA from infected Vero E6 cells was separated on a 
formaldehyde-agarose gel, transferred to a nylon membrane, 
and hybridized with a digoxigenin-labeled riboprobe over- 
lapping the 3'-untranslated region. Signals were visualized by 
chemiluminescence. Sizes of the SARS-CoV mRNAs were 
calculated by extrapolation from a log-linear fit of the 
molecular mass marker. Lane 1, SARS-CoV mRNA; lane 2, 
Vero E6 cell mRNA; lane 3, molecular mass marker, sizes in 
kB. 


SEQUENCE LISTING 


The nucleic and amino acid seguences listed in the accom- 
panving seguence listing are shown using standard letter 
abbreviations for nucleotide bases, and three letter code for 
amino acids, as defined in 37 C.F.R. 1.822. Only one strand of 
each nucleic acid seguence is shown, but the complementarv 
strand is understood as included bv anv reference to the 
displaved strand. In the accompanving seguence listing: 
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SEQ ID NO: 1 shows the nucleic acid sequence of the 
SARS-CoV genome. 

SEQ ID NO: 2 shows the amino acid sequence of the 
SARS-CoV polyprotein 1a (encoded by nucleic acid 265 to 
nucleic acid 13,398 of SEQ ID NO: 1). 

SEQ ID NO: 3 shows the amino acid sequence of the 
SARS-CoV polyprotein 1b (encoded by nucleic acid 13,398 
to 21,482 of SEQ ID NO: 1). 

SEQ ID NO: 4 shows the amino acid sequence of the 
SARS-CoV S protein (encoded by nucleic acid 21,492 to 
25,256 of SEQ ID NO: 1). 

SEQ ID NO: 5 shows the amino acid sequence of the 
SARS-CoV X1 protein (encoded by nucleic acid 25,268 to 
26,089 of SEQ ID NO: 1). 

SEQ ID NO: 6 shows the amino acid sequence of the 
SARS-CoV X2 protein (encoded by nucleic acid 25,689 to 
26,150 of SEQ ID NO: 1). 

SEQ ID NO: 7 shows the amino acid sequence of the 
SARS-CoV E protein (encoded by nucleic acid 26,117 to 
26,344 of SEQ ID NO: 1). 

SEQ ID NO: 8 shows the amino acid sequence of the 
SARS-CoV M protein (encoded by nucleic acid 26,398 to 
27,060 of SEQ ID NO: 1). 

SEQ ID NO: 9 shows the amino acid sequence of the 
SARS-CoV X3 protein (encoded by nucleic acid 27,074 to 
27,262 of SEQ ID NO: 1). 

SEQ ID NO: 10 shows the amino acid sequence of the 
SARS-CoV X4 protein (encoded by nucleic acid 27,273 to 
27,638 of SEQ ID NO: 1). 

SEQ ID NO: 11 shows the amino acid sequence of the 
SARS-CoV X5 protein (encoded by nucleic acid 27,864 to 
28,115 of SEQ ID NO: 1). 

SEQ ID NO: 12 shows the amino acid sequence of the 
SARS-CoV N protein (encoded by nucleic acid 28,120 to 
29,385 of SEQ ID NO: 1). 

SEQ ID NOs: 13-15 show the nucleic acid sequence of 
several SARS-CoV-specific oligonucleotide primers. 

SEQ ID NOs: 16-33 show the nucleic acid sequence of 
several oligonucleotide primers/probes used for real-time 
reverse transcription-polymerase chain reaction (RT-PCR) 
SARS-CoV assays. 

SEQ ID NOs: 34-35 show the nucleic acid sequence of two 
degenerate primers designed to anneal to sites encoding con- 
served coronavirus amino acid motifs. 

SEQ ID NOs: 36-38 show the nucleic acid sequence of 
several oligonucleotide primers/probes used as controls in 
real-time RT-PCR assays. 











DETAILED DESCRIPTION OF SEVERAL 
EMBODIMENTS 


I. Abbreviations 


BAL: bronchoalveolar lavage 

CPE: cytopathic effect 

E: coronavirus transmembrane protein 

ELISA: enzyme-linked immunosorbent assay 
HE: coronavirus hemagglutinin-esterase protein 
IFA: indirect fluorescent antibody 

M: coronavirus membrane protein 

N: coronavirus nucleoprotein 

ORF: open reading frame 

PCR polymerase chain reaction 

RACE: 5' rapid amplification of cDNA ends 
RT-PCR: reverse transcription-polymerase chain reaction 
S: coronavirus spike protein 

SARS: severe acute respiratory syndrome 
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SARS-CoV: severe acute respiratorv syndrome-associated 
coronavirus 
TRS: transcriptional regulatory sequence 


II. Terms 


Unless otherwise noted, technical terms are used according 
to conventional usage. Definitions of common terms in 
molecular biology may be found in Benjamin Lewin, Genes 
VII, published by Oxford University Press, 2000 (ISBN 
019879276X); Kendrew et al. (eds.), The Encyclopedia of 
Molecular Biology, published by Blackwell Publishers, 1994 
(ISBN 0632021829); and Robert A. Meyers (ed.), Molecular 
Biology and Biotechnology: a Comprehensive Desk Refer- 
ence, published by Wiley, John & Sons, Inc., 1995 (ISBN 
0471186341); and other similar references. 

As used herein, the singular terms “a,” “an,” and “the” 
include plural referents unless context clearly indicates oth- 
erwise. Similarly, the word “or” is intended to include “and” 
unless the context clearly indicates otherwise. Also, as used 
herein, the term “comprises” means “includes.” Hence “com- 
prising A or B" means including A, B, or A and B. It is further 
to be understood that all nucleotide sizes or amino acid sizes, 
and all molecular weight or molecular mass values, given for 
nucleic acids or polypeptides are approximate, and are pro- 
vided for description. Although methods and materials simi- 
lar or equivalent to those described herein can be used in the 
practice or testing of the present invention, suitable methods 
and materials are described below. All publications, patent 
applications, patents, and other references mentioned herein 
are incorporated by reference in their entirety. In case of 
conflict, the present specification, including explanations of 
terms, will control. In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

In order to facilitate review of the various embodiments of 
this disclosure, the following explanations of specific terms 
are provided: 

Adjuvant: A substance that non-specifically enhances the 
immune response to an antigen. Development of vaccine 
adjuvants for use in humans is reviewed in Singh et al. (Nat. 
Biotechnol. 17:1075-1081, 1999), which discloses that, at the 
time of its publication, aluminum salts and the MF59 micro- 
emulsion are the only vaccine adjuvants approved for human 
use. 

Amplification: Amplification of a nucleic acid molecule 
(e.g., a DNA or RNA molecule) refers to use of a laboratory 
technique that increases the number of copies of a nucleic 
acid molecule ina sample. An example of amplification is the 
polymerase chain reaction (PCR), in which a sample is con- 
tacted with a pair of oligonucleotide primers under conditions 
that allow for the hybridization of the primers to anucleic acid 
template in the sample. The primers are extended under suit- 
able conditions, dissociated from the template, re-annealed, 
extended, and dissociated to amplify the number of copies of 
the nucleic acid. The product of amplification can be charac- 
terized by such techniques as electrophoresis, restriction 
endonuclease cleavage patterns, oligonucleotide hybridiza- 
tion or ligation, and/or nucleic acid sequencing. 

Other examples of amplification methods include strand 
displacement amplification, as disclosed in U.S. Pat. No. 
5,744,311; transcription-free isothermal amplification, as 
disclosed in U.S. Pat. No. 6,033,881; repair chain reaction 
amplification, as disclosed in WO 90/01069; ligase chain 
reaction amplification, as disclosed in EP-A-320,308; gap 
filling ligase chain reaction amplification, as disclosed in U.S. 
Pat. No. 5,427,930; and NASBA™ RNA transcription-free 
amplification, as disclosed in U.S. Pat. No. 6,025,134. An 
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amplification method can be modified, including for example 
by additional steps or coupling the amplification with another 
protocol. 

Animal: Living multi-cellular vertebrate organisms, a cat- 
egory that includes, for example, mammals and birds. The 
term mammal includes both human and non-human mam- 
mals. Similarly, the term “subject” includes both human and 
veterinary subjects, for example, humans, non-human pri- 
mates, dogs, cats, horses, and cows. 

Antibody: A protein (or protein complex) that includes one 
or more polypeptides substantially encoded by immunoglo- 
bulin genes or fragments of immunoglobulin genes. The rec- 
ognized immunoglobulin genes include the kappa, lambda, 
alpha, gamma, delta, epsilon, and mu constant region genes, 
as well as the myriad immunoglobulin variable region genes. 
Light chains are classified as either kappa or lambda. Heavy 
chains are classified as gamma, mu, alpha, delta, or epsilon, 
which in turn define the immunoglobulin classes, IgG, IgM, 
IgA, IgD and IgE, respectively. 

The basic immunoglobulin (antibody) structural unit is 
generally a tetramer. Each tetramer is composed of two iden- 
tical pairs of polypeptide chains, each pair having one “light” 
(about 25 kDa) and one “heavy” (about 50-70 kDa) chain. 
The N-terminus of each chain defines a variable region of 
about 100 to 110 or more amino acids primarily responsible 
for antigen recognition. The terms “variable light chain" (V,) 
and “variable heavy chain" (V jj) refer, respectively, to these 
light and heavy chains 

As used herein, the term “antibodies” includes intact 
immunoglobulins as well as a number of well-characterized 
fragments. For instance, Fabs, Fvs, and single-chain Fvs 
(SCFvs) that bind to target protein (or epitope within a protein 
or fusion protein) would also be specific binding agents for 
that protein (or epitope). These antibody fragments are 
defined as follows: (1) Fab, the fragment which contains a 
monovalent antigen-binding fragment of an antibody mol- 
ecule produced by digestion of whole antibody with the 
enzyme papain to yield an intact light chain and a portion of 
one heavy chain; (2) Fab', the fragment of an antibody mol- 
ecule obtained by treating whole antibody with pepsin, fol- 
lowed by reduction, to yield an intact light chain and a portion 
of the heavy chain; two Fab' fragments are obtained per 
antibody molecule; (3) (Fab'),, the fragment of the antibody 
obtained by treating whole antibody with the enzyme pepsin 
without subsequent reduction; (4) F(ab'),, a dimer of two Fab' 
fragments held together by two disulfide bonds; (5) Fv, a 
genetically engineered fragment containing the variable 
region of the light chain and the variable region of the heavy 
chain expressed as two chains; and (6) single chain antibody, 
a genetically engineered molecule containing the variable 
region of the light chain, the variable region of the heavy 
chain, linked by a suitable polypeptide linker as a genetically 
fused single chain molecule. Methods of making these frag- 
ments are routine (see, for example, Harlow and Lane, Using 
Antibodies: A Laboratory Manual, CSHL, New York, 1999). 

Antibodies for use in the methods and devices of this 
disclosure can be monoclonal or polyclonal. Merely by way 
of example, monoclonal antibodies can be prepared from 
murine hybridomas according to the classical method of 
Kohler and Milstein (Nature 256:495-97, 1975) or derivative 
methods thereof. Detailed procedures for monoclonal anti- 
body production are described in Harlow and Lane, Using 
Antibodies: A Laboratory Manual, CSHL, New York, 1999. 

Antigen: À compound, composition, or substance that can 
stimulate the production of antibodies or a T-cell response in 
an animal, including compositions that are injected or 
absorbed into an animal. Àn antigen reacts with the products 
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of specific humoral or cellular immunitv, including those 
induced bv heterologous immunogens. In one embodiment, 
an antigen is a coronavirus antigen. 

Binding or Stable Binding: An oligonucleotide binds or 
stably binds to a target nucleic acid if a sufficient amount of 
the oligonucleotide forms base pairs or is hybridized to its 
target nucleic acid, to permit detection of that binding. Bind- 
ing can be detected by either physical or functional properties 
of the target:oligonucleotide complex. Binding between a 
target and an oligonucleotide can be detected by any proce- 
dure known to one skilled in the art, including functional or 
physical binding assays. Binding can be detected functionally 
by determining whether binding has an observable effect 
upon a biosynthetic process such as expression of a gene, 
DNA replication, transcription, translation, and the like. 

Physical methods of detecting the binding of complemen- 
tary strands of DNA or RNA are well known in the art, and 
include such methods as DNase I or chemical footprinting, 
gel shift and affinity cleavage assays, Northern blotting, 
Southern blotting, dot blotting, and light absorption detection 
procedures. For example, a method which is widely used, 
because it is so simple and reliable, involves observing a 
change in light absorption of a solution containing an oligo- 
nucleotide (or an analog) and a target nucleic acid at 220 to 
300 nm as the temperature is slowly increased. If the oligo- 
nucleotide or analog has bound to its target, there is a sudden 
increase in absorption at a characteristic temperature as the 
oligonucleotide (or analog) and target dissociate or melt. 

The binding between an oligomer and its target nucleic 
acid is frequently characterized by the temperature (T,,) at 
which 50% of the oligomer is melted from its target. A higher 
T,, means a stronger or more stable complex relative to a 
complex with a lower T,,. 

cDNA (complementary DNA): A piece of DNA lacking 
internal, non-coding segments (introns) and regulatory 
sequences that determine transcription. cDNA is synthesized 
in the laboratory by reverse transcription from messenger 
RNA extracted from cells. 

Electrophoresis: Electrophoresis refers to the migration of 
charged solutes or particles in a liquid medium under the 
influence of an electric field. Electrophoretic separations are 
widely used for analysis of macromolecules. Of particular 
importance is the identification of proteins and nucleic acid 
sequences. Such separations can be based on differences in 
size and/or charge. Nucleotide sequences have a uniform 
charge and are therefore separated based on differences in 
size. Electrophoresis can be performed in an unsupported 
liquid medium (for example, capillary electrophoresis), but 
more commonly the liquid medium travels through a solid 
supporting medium. The most widely used supporting media 
are gels, for example, polyacrylamide and agarose gels. 

Sieving gels (for example, agarose) impede the flow of 
molecules. The pore size of the gel determines the size of a 
molecule that can flow freely through the gel. The amount of 
time to travel through the gel increases as the size of the 
molecule increases. As a result, small molecules travel 
through the gel more quickly than large molecules and thus 
progress further from the sample application area than larger 
molecules, in a given time period. Such gels are used for 
size-based separations of nucleotide sequences. 

Fragments of linear DNA migrate through agarose gels 
with a mobility that is inversely proportional to the log,, of 
their molecular weight. By using gels with different concen- 
trations of agarose, different sizes of DNA fragments can be 
resolved. Higher concentrations of agarose facilitate separa- 
tion of small DNAs, while low agarose concentrations allow 
resolution of larger DNAs. 
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Hybridization: Oligonucleotides and their analogs hybrid- 
ize by hydrogen bonding, which includes Watson-Crick, 
Hoogsteen or reversed Hoogsteen hydrogen bonding, 
between complementary bases. Generally, nucleic acid con- 
sists of nitrogenous bases that are either pyrimidines (cy- 
tosine (C), uracil (U), and thymine (T)) or purines (adenine 
(A) and guanine (G)). These nitrogenous bases form hydro- 
gen bonds between a pyrimidine and a purine, and the bond- 
ing of the pyrimidine to the purine is referred to as “base 
pairing.” More specifically, A will hydrogen bond to T or U, 
and G will bond to C. “Complementary” refers to the base 
pairing that occurs between two distinct nucleic acid 
sequences or two distinct regions of the same nucleic acid 
sequence. 


“Specifically hybridizable” and “specifically complemen- 
tary” are terms that indicate a sufficient degree of comple- 
mentarity such that stable and specific binding occurs 
between the oligonucleotide (or its analog) and the DNA or 
RNA target. The oligonucleotide or oligonucleotide analog 
need not be 100% complementary to its target sequence to be 
specifically hybridizable. An oligonucleotide or analog is 
specifically hybridizable when binding of the oligonucleotide 
or analog to the target DNA or RNA molecule interferes with 
the normal function of the target DNA or RNA, and there is a 
sufficient degree of complementarity to avoid non-specific 
binding of the oligonucleotide or analog to non-target 
sequences under conditions where specific binding is desired, 
for example under physiological conditions in the case of in 
vivo assays or systems. Such binding is referred to as specific 
hybridization. 

Hybridization conditions resulting in particular degrees of 
stringency will vary depending upon the nature of the hybrid- 
ization method of choice and the composition and length of 
the hybridizing nucleic acid sequences. Generally, the tem- 
perature of hybridization and the ionic strength (especially 
the Na* and/or Mg** concentration) of the hybridization 
buffer will determine the stringency of hybridization, though 
wash times also influence stringency. Calculations regarding 
hybridization conditions required for attaining particular 
degrees of stringency are discussed by Sambrook et al. (ed.), 
Molecular Cloning: A Laboratory Manual, 2" ed., vol. 1-3, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y., 1989, chapters 9 and 11; and Ausubel et al. Short Pro- 
tocols in Molecular Biology, 4" ed., John Wiley & Sons, Inc., 
1999. 


For purposes of the present disclosure, “stringent condi- 
tions" encompass conditions under which hybridization will 
only occur if there is less than 25% mismatch between the 
hybridization molecule and the target sequence. “Stringent 
conditions" may be broken down into particular levels of 
stringency for more precise definition. Thus, as used herein, 
"moderate stringency" conditions are those under which mol- 
ecules with more than 25% sequence mismatch will not 
hybridize; conditions of “medium stringency" are those 
under which molecules with more than 15% mismatch will 
not hybridize, and conditions of “high stringency" are those 
under which sequences with more than 1096 mismatch will 
not hybridize. Conditions of *very high stringency" are those 
under which sequences with more than 6% mismatch will not 
hybridize. 

Immune Stimulatory Composition: À term used herein to 
mean a composition useful for stimulating or eliciting a spe- 
cific immune response (or immunogenic response) in a ver- 
tebrate. In some embodiments, the immunogenic response is 
protective or provides protective immunity, in that it enables 
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the vertebrate animal to better resist infection with or disease 
progression from the organism against which the vaccine is 
directed. 

Without wishing to be bound by a specific theory, it is 
believed that an immunogenic response may arise from the 
generation of an antibody specific to one or more of the 
epitopes provided in the immune stimulatory composition. 
Alternatively, the response may comprise a T-helper or cyto- 
toxic cell-based response to one or more of the epitopes 
provided in the immune stimulatory composition. All three of 
these responses may originate from naive or memory cells. 
One specific example of a type of immune stimulatory com- 
position is a vaccine. 

In some embodiments, an “effective amount" or *immune- 
stimulatory amount" of an immune stimulatory composition 
is an amount which, when administered to a subject, is suffi- 
cient to engender a detectable immune response. Such a 
response may comprise, for instance, generation of an anti- 
body specific to one or more of the epitopes provided in the 
immune stimulatory composition. Alternatively, the response 
may comprise a T-helper or CTL-based response to one or 
more of the epitopes provided in the immune stimulatory 
composition. All three of these responses may originate from 
naive or memory cells. In other embodiments, a “protective 
effective amount" of an immune stimulatory composition is 
anamount which, when administered to a subject, is sufficient 
to confer protective immunity upon the subject. 

Inhibiting or Treating a Disease: Inhibiting the full devel- 
opment of a disease or condition, for example, in a subject 
who is at risk for a disease such as SARS. "Treatment" refers 
to a therapeutic intervention that ameliorates a sign or symp- 
tom ofa disease or pathological condition after it has begun to 
develop. As used herein, the term “ameliorating,” with refer- 
ence to a disease, pathological condition or symptom, refers 
to any observable beneficial effect ofthe treatment. The ben- 
eficial effect can be evidenced, for example, by a delayed 
onset of clinical symptoms of the disease in a susceptible 
subject, a reduction in severity of some or all clinical symp- 
toms of the disease, a slower progression of the disease, a 
reduction in the number of relapses of the disease, an 
improvement in the overall health or well-being of the sub- 
ject, or by other parameters well known in the art that are 
specific to the particular disease. 

Isolated: An “isolated” microorganism (such as a virus, 
bacterium, fungus, or protozoan) has been substantially sepa- 
rated or purified away from microorganisms of different 
types, strains, or species. Microorganisms can be isolated by 
a variety of techniques, including serial dilution and cultur- 
ing. 

An “isolated” biological component (such as a nucleic acid 
molecule, protein or organelle) has been substantially sepa- 
rated or purified away from other biological components in 
the cell of the organism in which the component naturally 
Occurs, such as other chromosomal and extra-chromosomal 
DNA and RNA, proteins, and organelles. Nucleic acids and 
proteins that have been “isolated” include nucleic acids and 
proteins purified by standard purification methods. The term 
also embraces nucleic acids and proteins prepared by recom- 
binant expression in a host cell, as well as chemically synthe- 
sized nucleic acids or proteins, or fragments thereof. 

Label: A detectable compound or composition that is con- 
jugated directly or indirectly to another molecule to facilitate 
detection of that molecule. Specific, non-limiting examples 
of labels include fluorescent tags, enzymatic linkages, and 
radioactive isotopes. 

Nucleic Acid Molecule: A polymeric form of nucleotides, 
which may include both sense and anti-sense strands of RNA, 
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cDNA, genomic DNA, and synthetic forms and mixed poly- 
mers of the above. A nucleotide refers to a ribonucleotide, 
deoxynucleotide or a modified form of either type of nucle- 
otide. A “nucleic acid molecule” as used herein is synony- 
mous with “nucleic acid” and “polynucleotide.” A nucleic 
acid molecule is usually at least 10 bases in length, unless 
otherwise specified. The term includes single- and double- 
stranded forms of DNA. A polynucleotide may include either 
or both naturally occurring and modified nucleotides linked 
together by naturally occurring and/or non-naturally occur- 
ring nucleotide linkages. 

Oligonucleotide: A nucleic acid molecule generally com- 
prising a length of 300 bases or fewer. The term often refers to 
single-stranded deoxyribonucleotides, but it can refer as well 
to single- or double-stranded ribonucleotides, RNA:DNA 
hybrids and double-stranded DNAs, among others. The term 
“oligonucleotide” also includes oligonucleosides (that is, an 
oligonucleotide minus the phosphate) and any other organic 
base polymer. In some examples, oligonucleotides are about 
10 to about 90 bases in length, for example, 12, 13, 14, 15, 16, 
17, 18, 19 or 20 bases in length. Other oligonucleotides are 
about 25, about 30, about 35, about 40, about 45, about 50, 
about 55, about 60 bases, about 65 bases, about 70 bases, 
about 75 bases or about 80 bases in length. Oligonucleotides 
may be single-stranded, for example, for use as probes or 
primers, or may be double-stranded, for example, for use in 
the construction of a mutant gene. Oligonucleotides can be 
either sense or anti-sense oligonucleotides. An oligonucle- 
otide can be modified as discussed above in reference to 
nucleic acid molecules. Oligonucleotides can be obtained 
from existing nucleic acid sources (for example, genomic or 
cDNA), but can also be synthetic (for example, produced by 
laboratory or in vitro oligonucleotide synthesis). 

Open Reading Frame (ORF): A series of nucleotide triplets 
(codons) coding for amino acids without any internal termi- 
nation codons. These sequences are usually translatable into 
a peptide/polypeptide/protein/polyprotein. 

Operably Linked: A first nucleic acid sequence is operably 
linked with a second nucleic acid sequence when the first 
nucleic acid sequence is placed in a functional relationship 
with the second nucleic acid sequence. For instance, a pro- 
moter is operably linked to a coding sequence if the promoter 
affects the transcription or expression of the coding sequence. 
Generally, operably linked DNA sequences are contiguous 
and, where necessary to join two protein coding regions, in 
the same reading frame. If introns are present, the operably 
linked DNA sequences may not be contiguous. 

Pharmaceutically Acceptable Carriers: The pharmaceuti- 
cally acceptable carriers useful in this disclosure are conven- 
tional. Remington’s Pharmaceutical Sciences, by E. W. Mar- 
tin, Mack Publishing Co., Easton, Pa., 15th Edition (1975), 
describes compositions and formulations suitable for phar- 
maceutical delivery of one or more therapeutic compounds or 
molecules, such as one or more SARS-CoV nucleic acid 
molecules, proteins or antibodies that bind these proteins, and 
additional pharmaceutical agents. 

In general, the nature of the carrier will depend on the 
particular mode of administration being employed. For 
instance, parenteral formulations usually comprise injectable 
fluids that include pharmaceutically and physiologically 
acceptable fluids such as water, physiological saline, bal- 
anced salt solutions, aqueous dextrose, glycerol or the like as 
a vehicle. For solid compositions (for example, powder, pill, 
tablet, or capsule forms), conventional non-toxic solid carri- 
ers can include, for example, pharmaceutical grades of man- 
nitol, lactose, starch, or magnesium stearate. In addition to 
biologically-neutral carriers, pharmaceutical compositions to 
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be administered can contain minor amounts of non-toxic 
auxiliarv substances, such as wetting or emulsifving agents, 
preservatives, and pH buffering agents and the like, for 
example sodium acetate or sorbitan monolaurate. 

Polvpeptide: A polvmer in which the monomers are amino 
acid residues which are joined together through amide bonds. 
When the amino acids are alpha-amino acids, either the L-op- 
tical isomer or the D-optical isomer can be used, the L-iso- 
mers being preferred. The terms “polypeptide” or “protein” 
as used herein are intended to encompass any amino acid 
sequence and include modified sequences such as glycopro- 
teins. The term “polypeptide” is specifically intended to cover 
naturally occurring proteins, as well as those which are 
recombinantly or synthetically produced. 

Conservative amino acid substitutions are those substitu- 
tions that, when made, least interfere with the properties of 
the original protein, that is, the structure and especially the 
function of the protein is conserved and not significantly 
changed by such substitutions. Examples of conservative 
substitutions are shown below. 


Original Residue Conservative Substitutions 


Ala Ser 

Arg Lys 

Asn Gln, His 

Asp Glu 

Cys Ser 

Gin Asn 

Glu Asp 

His Asn; Gln 

Ile Leu, Val 

Leu Ile; Val 

Lys Arg; Gln; Glu 
Met Leu; Ile 

Phe Met; Leu; Tyr 
Ser Thr 

Thr Ser 

Trp Tyr 

Tyr Trp; Phe 

Val Ile; Leu 


Conservative substitutions generally maintain (a) the struc- 
ture of the polypeptide backbone in the area of the substitu- 
tion, for example, as a sheet or helical conformation, (b) the 
charge or hydrophobicity of the molecule at the target site, or 
(c) the bulk of the side chain. 

The substitutions which in general are expected to produce 
the greatest changes in protein properties will be non-conser- 
vative, for instance changes in which (a) a hydrophilic resi- 
due, for example, sery] or threonyl, is substituted for (or by) a 
hydrophobic residue, for example, leucyl, isoleucyl, pheny- 
lalanyl, valyl or alanyl; (b) a cysteine or proline is substituted 
for (or by) any other residue; (c) a residue having an electrop- 
ositive side chain, for example, lysyl, arginyl, or histadyl, is 
substituted for (or by) an electronegative residue, for 
example, glutamyl or aspartyl; or (d) a residue having a bulky 
side chain, for example, phenylalanine, is substituted for (or 
by) one not having a side chain, for example, glycine. 


Probes and Primers: A probe comprises an isolated nucleic 
acid attached to a detectable label or other reporter molecule. 
Typical labels include radioactive isotopes, enzyme sub- 
strates, co-factors, ligands, chemiluminescent or fluorescent 
agents, haptens, and enzymes. Methods for labeling and guid- 
ance in the choice of labels appropriate for various purposes 
are discussed, for example, in Sambrook et al. (ed.), Molecu- 
lar Cloning: A Laboratory Manual, 2"? ed., vol. 1-3, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
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1989 and Ausubel et al. Short Protocols in Molecular Biol- 
ogy, 4” ed., John Wiley & Sons, Inc., 1999. 

Primers are short nucleic acid molecules, for instance DNA 
oligonucleotides 10 nucleotides or more in length, for 
example that hybridize to contiguous complementary nucle- 
otides ora sequence to be amplified. Longer DNA oligonucle- 
otides may be about 15, 20, 25, 30 or 50 nucleotides or more 
in length. Primers can be annealed to a complementary target 
DNA strand by nucleic acid hybridization to form a hybrid 
between the primer and the target DNA strand, and then the 
primer extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplifica- 
tion of a nucleic acid sequence, for example, by the PCR or 
other nucleic-acid amplification methods known in the art, as 
described above. 

Methods for preparing and using nucleic acid probes and 
primers are described, for example, in Sambrook et al. (ed.), 
Molecular Cloning: A Laboratory Manual, 274 ed., vol. 1-3, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y., 1989; Ausubel et al. Short Protocols in Molecular Biol- 
ogy, 4" ed., John Wiley & Sons, Inc., 1999; and Innis et al. 
PCR Protocols, A Guide to Methods and Applications, Aca- 
demic Press, Inc., San Diego, Calif., 1990. Amplification 
primer pairs can be derived from a known sequence, for 
example, by using computer programs intended for that pur- 
pose such as Primer (Version 0.5, © 1991, Whitehead Insti- 
tute for Biomedical Research, Cambridge, Mass.). One of 
ordinary skill in the art will appreciate that the specificity of a 
particular probe or primer increases with its length. Thus, in 
order to obtain greater specificity, probes and primers can be 
selected that comprise at least 20, 25, 30, 35, 40, 45, 50 or 
more consecutive nucleotides ofa target nucleotide sequence. 

Protein: A biological molecule, particularly a polypeptide, 
expressed by a gene and comprised of amino acids. A 
“polyprotein” is a protein that, after synthesis, is cleaved to 
produce several functionally distinct polypeptides. 

Purified: The term “purified” does not require absolute 
purity; rather, it is intended as a relative term. Thus, for 
example, a purified protein preparation is one in which the 
subject protein is more pure than in its natural environment 
within a cell. Generally, a protein preparation is purified such 
that the protein represents at least 50% of the total protein 
content of the preparation. 

Recombinant Nucleic Acid: A sequence that is not natu- 
rally occurring or has a sequence that is made by an artificial 
combination of two otherwise separated segments of 
sequence. This artificial combination is often accomplished 
by chemical synthesis or, more commonly, by the artificial 
manipulation of isolated segments of nucleic acids, for 
example, by genetic engineering techniques such as those 
described in Sambrook et al. (ed.), Molecular Cloning: A 
Laboratory Manual, 2" ed., vol. 1-3, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.X., 1989. The term 
recombinant includes nucleic acids that have been altered 
solely by addition, substitution, or deletion ofa portion of the 
nucleic acid. 

Sample: A portion, piece, or segment that is representative 
of a whole. This term encompasses any material, including 
for instance samples obtained from an animal, a plant, or the 
environment. 

An “environmental sample” includes a sample obtained 
from inanimate objects or reservoirs within an indoor or 
outdoor environment. Environmental samples include, but 
are not limited to: soil, water, dust, and air samples; bulk 
samples, including building materials, furniture, and landfill 
contents; and other reservoir samples, such as animal refuse, 
harvested grains, and foodstuffs. 
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A “biological sample” is a sample obtained from a plant or 
animal subject. As used herein, biological samples include all 
samples useful for detection of viral infection in subjects, 
including, but not limited to: cells, tissues, and bodily fluids, 
such as blood; derivatives and fractions of blood (such as 
serum); extracted galls; biopsied or surgically removed tis- 
sue, including tissues that are, for example, unfixed, frozen, 
fixed in formalin and/or embedded in paraffin; tears; milk; 
skin scrapes; surface washings; urine; sputum; cerebrospinal 
fluid; prostate fluid; pus; bone marrow aspirates; BAL; saliva; 
cervical swabs; vaginal swabs; and oropharyngeal wash. 

Sequence Identity The similarity between two nucleic acid 
sequences, or two amino acid sequences, is expressed in 
terms of the similarity between the sequences, otherwise 
referred to as sequence identity. Sequence identity is fre- 
quently measured in terms of percentage identity (or similar- 
ity or homology); the higher the percentage, the more similar 
the two sequences are. 

Methods of alignment of sequences for comparison are 
well known in the art. Various programs and alignment algo- 
rithms are described in: Smith and Waterman (Adv. Appl. 
Math., 2:482, 1981); Needleman and Wunsch (J. Mol. Biol., 
48:443, 1970); Pearson and Lipman (Proc. Natl. Acad. Sci., 
85:2444, 1988); Higgins and Sharp (Gene, 73:237-44, 1988); 
Higgins and Sharp (CABIOS, 5:151-53, 1989); Corpet et al. 
(Nuc. Acids Res., 16:10881-90, 1988); Huang et al. (Comp. 
Appls Biosci., 8:155-65, 1992); and Pearson et al. (Meth. Mol. 
Biol., 24:307-31, 1994). Altschul et al. (Nature Genet., 6:119- 
29, 1994) presents a detailed consideration of sequence align- 
ment methods and homology calculations. 

The alignment tools ALIGN (Myers and Miller, CABIOS 
4:11-17, 1989) or LFASTA (Pearson and Lipman, 1988) may 
be used to perform sequence comparisons (Internet Pro- 
gram© 1996, W. R. Pearson and the University of Virginia, 
“fasta20u63” version 2.0u63, release date December 1996). 
ALIGN compares entire sequences against one another, 
while LFASTA compares regions of local similarity. These 
alignment tools and their respective tutorials are available on 
the Internet at the NCSA website. Alternatively, for compari- 
sons of amino acid sequences of greater than about 30 amino 
acids, the “Blast 2 sequences” function can be employed 
using the default BLOSUM62 matrix set to default param- 
eters, (gap existence cost of 11, and a per residue gap cost of 
1). When aligning short peptides (fewer than around 30 amino 
acids), the alignment should be performed using the “Blast 2 
sequences” function, employing the PAM30 matrix set to 
default parameters (open gap 9, extension gap 1 penalties). 
The BLAST sequence comparison system is available, for 
instance, from the NCBI web site; see also Altschul et al., /. 
Mol. Biol., 215:403-10, 1990; Gish and States, Nature Genet., 
3:266-72, 1993; Madden et al., Meth. Enzvmol., 266:131-41, 
1996; Altschul et al., Nucleic Acids Res., 25:3389-402, 1997; 
and Zhang and Madden, Genome Res., 7:649-56, 1997. 

Orthologs (equivalent to proteins of other species) of pro- 
teins are in some instances characterized by possession of 
greater than 75% sequence identity counted over the full- 
length alignment with the amino acid sequence of specific 
protein using ALIGN set to default parameters. Proteins with 
even greater similarity to a reference sequence will show 
increasing percentage identities when assessed by this 
method, such as at least 80%, at least 85%, at least 90%, at 
least 92%, at least 95%, or at least 98% sequence identity. In 
addition, sequence identity can be compared over the full 
length of one or both binding domains of the disclosed fusion 
proteins. 

When significantly less than the entire sequence is being 
compared for sequence identity, homologous sequences will 
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typically possess at least 80% sequence identity over short 
windows of 10-20, and may possess sequence identities of at 
least 85%, at least 90%, at least 95%, or at least 99% depend- 
ing on their similarity to the reference sequence. Sequence 
identity over such short windows can be determined using 
LFASTA; methods are described at the NCSA website. One 
of skill in the art will appreciate that these sequence identity 
ranges are provided for guidance only; it is entirely possible 
that strongly significant homologs could be obtained that fall 
outside of the ranges provided. Similar homology concepts 
apply for nucleic acids as are described for protein. 

An alternative indication that two nucleic acid molecules 
are closely related is that the two molecules hybridize to each 
other under stringent conditions. Representative hybridiza- 
tion conditions are discussed above. 

Nucleic acid sequences that do not show a high degree of 
identity may nevertheless encode similar amino acid 
sequences, due to the degeneracy of the genetic code. It is 
understood that changes in nucleic acid sequence can be 
made using this degeneracy to produce multiple nucleic acid 
sequences that each encode substantially the same protein. 

Specific Binding Agent: An agent that binds substantially 
only to a defined target. Thus a protein-specific binding agent 
binds substantially only the defined protein, or to a specific 
region within the protein. As used herein, a protein-specific 
binding agent includes antibodies and other agents that bind 
substantially to a specified polypeptide. The antibodies may 
be monoclonal or polyclonal antibodies that are specific for 
the polypeptide, as well as immunologically effective por- 
tions (“fragments”) thereof. 

The determination that a particular agent binds substan- 
tially only to a specific polypeptide may readily be made by 
using or adapting routine procedures. One suitable in vitro 
assay makes use of the Western blotting procedure (described 
in many standard texts, including Harlow and Lane, Using 
Antibodies: A Laboratory Manual, CSHL, New York, 1999). 

Transformed: A “transformed” cell is a cell into which has 
been introduced a nucleic acid molecule by molecular biol- 
ogy techniques. The term encompasses all techniques by 
which a nucleic acid molecule might be introduced into such 
a cell, including transfection with viral vectors, transforma- 
tion with plasmid vectors, and introduction of naked DNA by 
electroporation, lipofection, and particle gun acceleration. 

Vector: A nucleic acid molecule as introduced into a host 
cell, thereby producing a transformed host cell. A vector may 
include nucleic acid sequences that permit it to replicate in a 
host cell, such as an origin of replication. A vector may also 
include one or more selectable marker genes and other 
genetic elements known in the art. 

Virus: Microscopic infectious organism that reproduces 
inside living cells. A virus typically consists essentially of a 
core of a single nucleic acid surrounded by a protein coat, and 
has the ability to replicate only inside a living cell. “Viral 
replication” is the production of additional virus by the occur- 
rence of at least one viral life cycle. A virus may subvert the 
host cells’ normal functions, causing the cell to behave in a 
manner determined by the virus. For example, a viral infec- 
tion may result in a cell producing a cytokine, or responding 
to a cytokine, when the uninfected cell does not normally do 
SO. 

“Coronaviruses” are large, enveloped, RNA viruses that 
cause respiratory and enteric diseases in humans and other 
animals. Coronavirus genomes are non-segmented, single- 
stranded, positive-sense RNA, approximately 27-31 kb in 
length. Genomes have a 5' methylated cap and 3' poly-A tail, 
and function directly as mRNA. Host cell entry occurs via 
endocytosis and membrane fusion, and replication occurs in 
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the cytoplasm. Initially, the 5' 20 kb of the positive-sense 
genome is translated to produce a viral polymerase, which 
then produces a full-length negative-sense strand used as a 
template to produce subgenomic mRNA as a “nested set” of 
transcripts. Assembly occurs by budding into the golgi appa- 
ratus, and particles are transported to the surface of the cell 
and released. 


III. Overview of Several Embodiments 


A newly isolated human coronavirus (SARS-CoV) is dis- 
closed herein. The entire genomic nucleic acid sequence of 
this virus is also provided herein. Also disclosed are the 
nucleic acid sequences of the SARS-CoV ORFs, and the 
polypeptide sequences encoded by these ORFs. Pharmaceu- 
tical andimmune stimulatory compositions are also disclosed 
that include one or more SARS-CoV viral nucleic acids, 
polypeptides encoded by these viral nucleic acids and anti- 
bodies that bind to a SARS-CoV polypeptide or SARS-CoV 
polypeptide fragment. 

In one embodiment, a method is provided for detecting the 
presence of SARS-CoV in a sample. This method includes 
contacting the sample with a pair of nucleic acid primers that 
hybridize to a SARS-CoV nucleic acid, wherein at least one 
primer is 5'-end labeled with a reporter dye, amplifying the 
SARS-CoV nucleic acid or a fragment thereof from the 
sample utilizing the pair of nucleic acid primers, electro- 
phoresing the amplified products, and detecting the 5'-end 
labeled reporter dye, thereby detecting a SARS-CoV. In a 
specific, non-limiting example, the amplification utilizes RT- 
PCR. Ina further specific example of the provided method, at 
least one of the nucleic acid primers that hybridize to a SARS- 
CoV nucleic acid includes a sequence as set forth in any one 
of SEQ ID NOs: 13-15. 

In another example of the provided method, detecting the 
presence of SARS-CoV in a sample includes contacting the 
sample with a pair of nucleic acid primers that hybridize to a 
SARS-CoV nucleic acid, amplifying the SARS-CoV nucleic 
acid ora fragment thereof from the sample utilizing the pair of 
nucleic acid primers, adding to the amplified SARS-CoV 
nucleic acid or the fragment thereof a TaqMan SARS-CoV 
probe that hybridizes to the SARS-CoV nucleic acid, wherein 
the TaqMan SARS-CoV probe is labeled with a 5'-reporter 
dye and a 3'-quencher dye, performing one or more additional 
rounds of amplification, and detecting fluorescence of the 
5'-reporter dye, thereby detecting a SARS-CoV. Ina specific, 
non-limiting example, the amplification utilizes RT-PCR. Ina 
further specific example of the provided method, at least one 
of the nucleic acid primers that hybridize to a SARS-CoV 
nucleic acid and/or the TaqMan SARS-CoV probe that 
hybridizes to the SARS-CoV nucleic acid includes a 
sequence as set forth in any one of SEQ ID NOs: 16-33. 

In another embodiment, a method is provided for detecting 
a SARS-CoV ina biological sample that contains antibodies. 
This method includes contacting the biological sample with a 
SARS-CoV-specific antigen, wherein the antigen includes a 
SARS-CoV organism and determining whether a binding 
reaction occurs between the SARS-CoV-specific antigen and 
an antibody in the biological sample if such is present, 
thereby detecting SARS-CoV. 

In a further embodiment, a method is provided for detect- 
ing a SARS-CoV in a biological sample that contains 
polypeptides and/or fragments thereof. This method includes 
contacting the biological sample with a SARS-CoV-specific 
antibody and determining whether a binding reaction occurs 
between the SARS-CoV-specific antibody and a SARS-CoV 
polypeptide or fragment thereof in the biological sample if 
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such is present, thereby detecting SARS-CoV. In a specific, 
non-limiting example, determining whether a binding reac- 
tion occurs between the SARS-CoV-specific antibody and a 
SARS-CoV polypeptide or fragment thereof is carried out in 
situ or in a tissue sample. In a further specific example, 
determining whether a binding reaction occurs between the 
SARS-CoV-specific antibody and a SARS-CoV polypeptide 
or fragment thereof includes an immunohistochemical assay. 

An additional embodiment includes a kit for detecting a 
SARS-CoV in a sample, including a pair of nucleic acid 
primers that hybridize under stringent conditions to a SARS- 
CoV nucleic acid, wherein one primer is 5'-end labeled with 
areporter dye. Ina specific, non-limiting example, at least one 
of the nucleic acid primers that hybridize to a SARS-CoV 
nucleic acid includes a sequence as set forth in any one of 
SEQ ID NOs: 13-15. 

Another example of the provided kit includes a pair of 
nucleic acid primers that hybridize under high stringency 
conditions to a SARS-CoV nucleic acid and a TaqMan 
SARS-CoV probe that hybridizes to the SARS-CoV nucleic 
acid, wherein the TaqMan SARS-CoV probe is labeled with a 
5'-reporter dye and a 3'-quencher dye. In a specific, non- 
limiting example, at least one of the nucleic acid primers that 
hybridize to a SARS-CoV nucleic acid and/or the TaqMan 
SARS-CoV probe that hybridizes to the SARS-CoV nucleic 
acid includes a sequence as set forth in any one of SEQ ID 
NOs: 16-33. 

Also disclosed herein is a composition including an iso- 
lated SARS-CoV organism. In one embodiment, the isolated 
SARS-CoV organism is an inactive isolated SARS-CoV 
organism. In another embodiment, the composition includes 
at least one component selected from the group consisting of 
pharmaceutically acceptable carriers, adjuvants and combi- 
nations of two or more thereof. In yet another embodiment, 
the composition is introduced into a subject, thereby eliciting 
an immune response against a SARS-CoV antigenic epitope 
in a subject. 


IV. SARS-CoV Nucleotide and Amino Acid 
Sequences 


The current disclosure provides an isolated SARS-CoV 
genome, isolated SARS-CoV polypeptides, and isolated 
nucleic acid molecules encoding the same. In one embodi- 
ment, the isolated SARS-CoV genome has a sequence as 
shown in SEQ ID NO: 1 or an equivalent thereof. Polynucle- 
otides encoding a SARS-CoV polypeptide (encoded by an 
ORF from within the genome) are also provided, and are 
termed SARS-CoV nucleic acid molecules. These nucleic 
acid molecules include DNA, cDNA and RNA sequences 
which encode a SARS-CoV polypeptide. Specific, non-lim- 
iting examples ofa SARS-CoV nucleic acid molecule encod- 
ing an ORF are nucleic acid 265 to nucleic acid 13,398 of 
SEQ ID NO: 1 (encoding SARS-CoV 1a, SEQ ID NO: 2), 
nucleic acid 13,398 to 21,482 of SEQ ID NO: 1 (encoding 
SARS-CoV 1b, SEQ ID NO: 3), nucleic acid 21,492 to 
25,256 of SEQ ID NO: 1 (encoding SARS-CoV S, SEQ ID 
NO: 4), nucleic acid 25,268 to 26,089 of SEQ ID NO: 1 
(encoding SARS-CoV X1, SEQ ID NO: 5), nucleic acid 
25,689 to 26,150 of SEQ ID NO: 1 (encoding SARS-CoV X2, 
SEQ ID NO: 6), nucleic acid 26,117 to 26,344 of SEQ ID NO: 
1 (encoding SARS-CoV E, SEQ ID NO: 7), nucleic acid 
26,398 to 27,060 of SEQ ID NO: 1 (encoding SARS-CoV M, 
SEQ ID NO: 8), nucleic acid 27,074 to 27,262 of SEQ ID NO: 
1 (encoding SARS-CoV X3, SEQ ID NO: 9), nucleic acid 
27,273 to 27,638 of SEQ ID NO: 1 (encoding SARS-CoV XA, 
SEQ ID NO: 10), nucleic acid 27,864 to 28,115 of SEQ ID 
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NO: 1 (encoding SARS-CoV X5, SEQ ID NO: 11), and 
nucleic acid 28,120 to 29,385 of SEQ ID NO: 1 (encoding 
SARS-CoV N, SEQ ID NO: 12). 


Oligonucleotide primers and probes derived from the 
SARS-CoV genome (SEQ ID NO: 1) are also encompassed 
within the scope of the present disclosure. Such oligonucle- 
otide primers and probes may comprise a sequence of at least 
about 15 consecutive nucleotides of the SARS-CoV nucleic 
acid sequence, such as at least about 20, 25, 30, 35, 40, 45, or 
50 or more consecutive nucleotides. These primers and 
probes may be obtained from any region of the disclosed 
SARS-CoV genome (SEQ ID NO: 1), including particularly 
from any of the ORFs disclosed herein. Specific, non-limiting 
examples of oligonucleotide primers derived from the SARS- 
CoV genome (SEQ ID NO: 1) include: Cor-p-F2 (SEQ ID 
NO: 13), Cor-p-F3 (SEQ ID NO: 14), Cor-p-R1 (SEQ ID NO: 
15), SARS1-F (SEQ ID NO: 16), SARSI—R (SEQ ID NO: 
17), SARS2-F (SEQ ID NO: 19), SARS2—R (SEQ ID NO: 
20), SARS3-F (SEQ ID NO: 22), SARS3—R (SEQ ID NO: 
23), N3-F (SEQ ID NO: 25), N3-R (SEQ ID NO: 26), 
3'NTR-F (SEQ ID NO: 28), 3'NTR-R (SEQ ID NO: 29), M-F 
(SEQ ID NO: 31), and M-R (SEQ ID NO: 32). Specific, 
non-limiting examples of oligonucleotide probes derived 
from the SARS-CoV genome (SEQ ID NO: 1) include: 
SARSI-P (SEQ ID NO: 18), SARS2-P (SEQ ID NO: 21), 
SARS3-P (SEQ ID NO: 24), N3-P (SEQ ID NO: 27), 
3'NTR-P (SEQ ID NO: 30), and M-P (SEQ ID NO: 33). 

Nucleic acid molecules encoding a SARS-CoV polypep- 
tide can be operatively linked to regulatory sequences or 
elements. Regulatory sequences or elements include, but are 
not limited to promoters, enhancers, transcription termina- 
tors, a start codon (for example, ATG), stop codons, and the 
like. 


Additionally, nucleic acid molecules encoding a SARS- 
CoV polypeptide can be inserted into an expression vector. 
Specific, non-limiting examples of vectors include, plasmids, 
bacteriophages, cosmids, animal viruses and yeast artificial 
chromosomes (YACs) (Burke et al., Science 236:806-12, 
1987). Such vectors may then be introduced into a variety of 
hosts including somatic cells, and simple or complex organ- 
isms, such as bacteria, fungi (Timberlake and Marshall, Sci- 
ence 244:1313-17, 1989), invertebrates, plants (Gasser et al., 
Plant Cell 1:15-24, 1989), and animals (Pursel et al., Science 
244:1281-88, 1989). 


Transformation of a host cell with an expression vector 
carrying a nucleic acid molecule encoding a SARS-CoV 
polypeptide may be carried out by conventional techniques, 
as are well known to those skilled in the art. By way of 
example, where the host is prokaryotic, such as Æ. coli, com- 
petent cells that are capable of DNA uptake can be prepared 
from cells harvested after exponential growth phase and sub- 
sequently treated by the CaCl, method using procedures well 
known in the art. Alternatively, MgCl, or RbCI can be used. 
Transformation can also be performed after forming a proto- 
plast of the host cell if desired, or by electroporation. 


When the host is a eukaryote, methods of transfection of 
DNA, such as calcium phosphate coprecipitates, conven- 
tional mechanical procedures such as microinjection, elec- 
troporation, insertion of a plasmid encased in liposomes, or 
virus vectors, may be used. Eukaryotic cells can also be 
cotransformed with SARS-CoV nucleic acid molecules, and 
a second foreign DNA molecule encoding a selectable phe- 
notype, such as the herpes simplex thymidine kinase gene. 
Another method is to use a eukaryotic viral vector, such as 
simian virus 40 or bovine papilloma virus, to transiently 
infect or transform eukaryotic cells and express the protein 
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(see, for example, Eukaryotic Viral Vectors, Cold Spring Har- 
bor Laboratory, Gluzman ed., 1982). 

The SARS-CoV polypeptides of this disclosure include 
proteins encoded by any of the ORFs disclosed herein, and 
equivalents thereof. Specific, non-limiting examples of 
SARS-CoV proteins are provided in SEQ ID NOs: 2-12. 
Fusion proteins are also contemplated that include a heter- 
ologous amino acid sequence chemically linked to a SARS- 
CoV. polypeptide. Exemplary heterologous sequences 
include short amino acid sequence tags (such as six histidine 
residues), as well as a fusion of other proteins (such as c-myc 
or green fluorescent protein fusions). Epitopes of the SARS- 
CoV proteins, that are recognized by an antibody or that bind 
the major histocompatibility complex, and can be used to 
induce a SARS-CoV-specific immune response, are also 
encompassed by this disclosure. 

Methods for expressing large amounts of protein from a 
cloned gene introduced into E. coli may be utilized for the 
purification and functional analysis of proteins. For example, 
fusion proteins consisting of amino terminal peptides 
encoded by a portion of the E. coli lacZ or trpE gene linked to 
SARS-CoV proteins may be used to prepare polyclonal and 
monoclonal antibodies against these proteins. 

Intact native protein may also be produced in Æ. coli in 
large amounts for functional studies. Methods and plasmid 
vectors for producing fusion proteins and intact native pro- 
teins in bacteria are described by Sambrook et al. (ed.), 
Molecular Cloning: A Laboratory Manual, 274 ed., vol. 1-3, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y., 1989. Such fusion proteins may be made in large 
amounts, are easy to purify, and can be used to elicit antibody 
response. Native proteins can be produced in bacteria by 
placing a strong, regulated promoter and an efficient ribo- 
some-binding site upstream of the cloned gene. If low levels 
of protein are produced, additional steps may be taken to 
increase protein production; if high levels of protein are pro- 
duced, purification is relatively easy. Suitable methods are 
presented by Sambrook et al. (ed.), Molecular Cloning: A 
Laboratory Manual, 2” ed., vol. 1-3, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989, and are 
well known in the art. Often, proteins expressed at high levels 
are found in insoluble inclusion bodies. Methods for extract- 
ing proteins from these aggregates are described by Sam- 
brook et al. (ed.), Molecular Cloning: A Laboratory Manual, 
2" ed., vol. 1-3, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y., 1989. 

Isolation and purification of recombinantly expressed pro- 
teins may be carried out by conventional means including 
preparative chromatography and immunological separations. 
Additionally, the proteins can be chemically synthesized by 
any ofa number of manual or automated methods of synthesis 
known in the art. 


V. Specific Binding A gents 


The disclosure provides specific binding agents that bind to 
SARS-CoV polypeptides disclosed herein. The binding agent 
may be useful for purifying and detecting the polypeptides, as 
well as for detection and diagnosis of SARS-CoV. Examples 
of the binding agents are a polyclonal or monoclonal anti- 
body, and fragments thereof, that bind to any of the SARS- 
CoV polypeptides disclosed herein. 

Monoclonal or polyclonal antibodies may be raised to rec- 
ognize a SARS-CoV polypeptide described herein, or a frag- 
ment or variant thereof. Optimally, antibodies raised against 
these polypeptides would specifically detect the polypeptide 
with which the antibodies are generated. That is, antibodies 
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raised against a specific SARS-CoV polypeptide will recog- 
nize and bind that polypeptide, and will not substantially 
recognize or bind to other polvpeptides or antigens. The 
determination that an antibodv specificallv binds to a target 
polypeptide is made by any one of a number of standard 
immunoassay methods; for instance, the Western blotting 
technique (Sambrook et al. (ed.), Molecular Cloning: A 
Laboratory Manual, 2" ed., vol. 1-3, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989). 

Substantially pure SARS-CoV recombinant polypeptide 
antigens suitable for use as immunogen may be isolated from 
the transformed cells described above, using methods well 
known in the art. Monoclonal or polyclonal antibodies to the 
antigens may then be prepared. 

Monoclonal antibodies to the polypeptides can be prepared 
from murine hybridomas according to the classic method of 
Kohler & Milstein (Nature 256:495-97, 1975), or a derivative 
method thereof. Briefly, a mouse is repetitively inoculated 
with a few micrograms of the selected protein immunogen 
(for example, a polypeptide comprising at least one SARS- 
CoV-specific epitope, a portion of a polypeptide comprising 
at least one SARS-CoV-specific epitope, or a synthetic pep- 
tide comprising at least one SARS-CoV-specific epitope) 
over a period of a few weeks. The mouse is then sacrificed, 
and the antibody-producing cells of the spleen isolated. The 
spleen cells are fused by means of polyethylene glycol with 
mouse myeloma cells, and the excess unfused cells destroyed 
by growth of the system on selective media comprising ami- 
nopterin (HAT media). The successfully fused cells are 
diluted and aliquots of the dilution placed in wells ofa micro- 
titer plate where growth of the culture is continued. Antibody- 
producing clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, 
such as ELISA, as originally described by Engvall (Meth. 
Enzymol., 70:419-39, 1980), or a derivative method thereof. 
Selected positive clones can be expanded and their mono- 
clonal antibody product harvested for use. Detailed proce- 
dures for monoclonal antibody production are described in 
Harlow and Lane, Using Antibodies: A Laboratory Manual, 
CSHL, New York, 1999. 

Polyclonal antiserum containing antibodies can be pre- 
pared by immunizing suitable animals with a polypeptide 
comprising at least one SARS-CoV-specific epitope, a por- 
tion of a polypeptide comprising at least one SARS-CoV- 
specific epitope, or a synthetic peptide comprising at least one 
SARS-CoV-specific epitope, which can be unmodified or 
modified, to enhance immunogenicity. 

Effective antibody production (whether monoclonal or 
polyclonal) is affected by many factors related both to the 
antigen and the host species. For example, small molecules 
tend to be less immunogenic than others and may require the 
use of carriers and adjuvant. Also, host animals vary in 
response to site of inoculations and dose, with either inad- 
equate or excessive doses of antigen resulting in low titer 
antisera. Small doses (ng level) of antigen administered at 
multiple intradermal sites appear to be most reliable. An 
effective immunization protocol for rabbits can be found in 
Vaitukaitis et al. (J. Clin. Endocrinol. Metab., 33:988-91, 
1971). 

Booster injections can be given at regular intervals, and 
antiserum harvested when the antibody titer thereof, as deter- 
mined semi-quantitatively, for example, by double immun- 
odiffusion in agar against known concentrations of the anti- 
gen, begins to fall. See, for example, Ouchterlony et al., 
Handbook of Experimental Immunology, Wier, D. (ed.), 
Chapter 19, Blackwell, 1973. A plateau concentration of anti- 
body is usually in the range of 0.1 to 0.2 mg/ml of serum 
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(about 12 uM). Affinity of the antisera for the antigen is 
determined by preparing competitive binding curves, as 
described, for example, by Fisher (Manual of Clinical Immu- 
nology, Ch. 42, 1980). 

Antibody fragments may be used in place of whole anti- 
bodies and may be readily expressed in prokaryotic host cells. 
Methods of making and using immunologically effective por- 
tions of monoclonal antibodies, also referred to as “antibody 
fragments,” are well known and include those described in 
Better & Horowitz, Methods Enzymol. 178:476-96, 1989; 
Glockshuber et al., Biochemistry 29:1362-67, 1990; and U.S. 
Pat. Nos. 5,648,237 (Expression of Functional Antibody 
Fragments); 4,946,778 (Single Polypeptide Chain Binding 
Molecules); and 5,455,030 (Immunotherapy Using Single 
Chain Polypeptide Binding Molecules), and references cited 
therein. Conditions whereby a polypeptide/binding agent 
complex can form, as well as assays for the detection of the 
formation of a polypeptide/binding agent complex and quan- 
titation of binding affinities of the binding agent and polypep- 
tide, are standard in the art. Such assays can include, but are 
not limited to, Western blotting, immunoprecipitation, immu- 
nofluorescence, immunocytochemistry, immunohistochem- 
istry, fluorescence activated cell sorting (FACS), fluorescence 
in situ hybridization (FISH), immunomagnetic assays, 
ELISA, ELISPOT (Coligan et al, Current Protocols in 
Immunology, Wiley, NY, 1995), agglutination assays, floccu- 
lation assays, cell panning, and the like, as are well known to 
one of skill in the art. 

Binding agents of this disclosure can be bound to a sub- 
strate (for example, beads, tubes, slides, plates, nitrocellulose 
sheets, and the like) or conjugated with a detectable moiety, or 
both bound and conjugated. The detectable moieties contem- 
plated for the present disclosure can include, but are not 
limited to, an immunofluorescent moiety (for example, fluo- 
rescein, rhodamine), a radioactive moiety (for example, ??P, 
125], 35S), an enzyme moiety (for example, horseradish per- 
oxidase, alkaline phosphatase), a colloidal gold moiety, and a 
biotin moiety. Such conjugation techniques are standard in 
the art (for example, see Harlow and Lane, Using Antibodies: 
A Laboratory Manual, CSHL, New York, 1999; Yang et al., 
Nature, 382:319-24, 1996). 


VI. Detection and Diagnosis of SARS-CoV 


A. Nucleic Acid Based Methods of Detection and Diagnosis 

A major application of the SARS-CoV sequence informa- 
tion presented herein is in the area of detection and diagnostic 
testing for SARS-CoV infection. Methods for screening a 
subject to determine if the subject has been or is currently 
infected with SARS-CoV are disclosed herein. 

One such method includes providing a sample, which 
sample includes a nucleic acid such as DNA or RNA, and 
providing an assay for detecting in the sample the presence of 
a SARS-CoV nucleic acid molecule. Suitable samples 
include all biological samples useful for detection of viral 
infection in subjects, including, but not limited to, cells, tis- 
sues (for example, lung and kidney), bodily fluids (for 
example, blood, serum, urine, saliva, sputum, and cerebrospi- 
nal fluid), bone marrow aspirates, BAL, and oropharyngeal 
wash. Additional suitable samples include all environmental 
samples useful for detection of viral presence in the environ- 
ment, including, but not limited to, a sample obtained from 
inanimate objects or reservoirs within an indoor or outdoor 
environment. The detection in the sample of a SARS-CoV 
nucleic acid molecule may be performed by a number of 
methodologies, non-limiting examples of which are outlined 
below. 
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Inone embodiment, detecting in the sample the presence of 
a SARS-CoV nucleic acid molecule includes the amplifica- 
tion of a SARS-CoV nucleic acid sequence (or a fragment 
thereof). Any nucleic acid amplification method can be used. 
In one specific, non-limiting example, PCR is used to amplify 
the SARS-CoV nucleic acid sequence(s). In another non- 
limiting example, RT-PCR can be used to amplify the SARS- 
CoV nucleic acid sequences. In an additional non-limiting 
example, transcription-mediated amplification can be used to 
amplify the SARS-CoV nucleic acid sequences. 

In some embodiments, a pair of SARS-CoV-specific prim- 
ers are utilized in the amplification reaction. One or both of 
the primers can be end-labeled (for example, radiolabeled, 
fluoresceinated, or biotinylated). In one specific, non-limiting 
example, at least one of the primers is 5'-end labeled with the 
reporter dye 6-carboxyfluorescein (6-FAM). The pair of 
primers includes an upstream primer (which binds 5' to the 
downstream primer) and a downstream primer (which binds 
3' to the upstream primer). In one embodiment, either the 
upstream primer or the downstream primer is labeled. Spe- 
cific, non-limiting examples of SARS-CoV-specific primers 
include, but are not limited to: Cor-p-F2 (SEQ ID NO: 13), 
Cor-p-F3 (SEQ ID NO: 14), Cor-p-R1 (SEQ ID NO: 15), 
SARSI-F (SEQ ID NO: 16), SARS1-R (SEQ ID NO: 17), 
SARS2-F (SEQ ID NO: 19), SARS2-R (SEQ ID NO: 20), 
SARS3-F (SEQ ID NO: 22), SARS3R (SEQ ID NO: 23), 
N3-F (SEQ ID NO: 25), N3-R (SEQ ID NO: 26), 3'NTR-F 
(SEQ ID NO: 28), 3'NTR-R (SEQ ID NO: 29), M-F (SEQ ID 
NO: 31), and M-R (SEQ ID NO: 32). Additional primer pairs 
can be generated, for instance, to amplify any of the specific 
ORFs described herein, using well known primer design prin- 
ciples and methods. 

In one specific, non-limiting example, electrophoresis is 
used to detect amplified SARS-CoV-specific sequences. 
Electrophoresis can be automated using many methods well 
know in the art. In one embodiment, a genetic analyzer is 
used, such as an ABI 3100 Prism Genetic Analyzer (PE 
Applied Biosystems, Foster City, Calif.), wherein the bands 
are analyzed using GeneScan software (PE Applied Biosys- 
tems, Foster City, Calif.). 

In another specific, non-limiting example, hybridization 
assays are used to detect amplified SARS-CoV-specific 
sequences using distinguishing oligonucleotide probes. Such 
probes include “TaqMan” probes. TaqMan probes consist of 
an oligonucleotide with a reporter at the 5'-end and a quencher 
at the 3'-end. In one specific, non-limiting example, the 
reporter is 6-FAM and the quencher is Blackhole Quencher 
(Biosearch Tech., Inc., Novato, Calif.). When the probe is 
intact, the proximity of the reporter to the quencher results in 
suppression of reporter fluorescence, primarily by fluores- 
cence resonance energy transfer. If the target of interest is 
present, the TaqMan probe specifically hybridizes between 
the forward and reverse primer sites during the PCR anneal- 
ing step. In the process of PCR elongation, the 5'-3' nucle- 
olytic activity of the Taq DNA polymerase cleaves the hybrid- 
ized probe between the reporter and the quencher. The probe 
fragments are then displaced from the target, and polymer- 
ization of the strand continues. Taq DNA polymerase does not 
cleave non-hybridized probe, and cleaves the hybridized 
probe only during polymerization. The 3'-end of the probe is 
blocked to prevent extension of the probe during PCR. The 
5'-3' nuclease cleavage of the hybridized probe occurs in 
every cycle and does not interfere with the exponential accu- 
mulation of PCR product. Accumulation of PCR products is 
detected directly by monitoring the increase in fluorescence 
of the released reporter. The increase in fluorescence signal is 
detected only if the target sequence is complementary to the 
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probe and is amplified during PCR. Therefore, non-specific 
amplification is not detected. SARS-CoV-specific TaqMan 
probes of the present disclosure include, but are not limited to: 
SARSI-P (SEQ ID NO: 18), SARS2-P (SEQ ID NO: 21), 
SARS3-P (SEQ ID NO: 24), N3-P (SEQ ID NO: 27), 
3'NTR-P (SEQ ID NO: 30), and M-P (SEQ ID NO: 33), and 
hybridization assays include, but are not limited to, a real- 
time RT-PCR assay. 


B. Protein Based Methods of Detection and Diagnosis 

The present disclosure further provides methods of detect- 
ing a SARS-CoV antigen in a sample, and/or diagnosing 
SARS-CoV infection in a subject by detecting a SARS-CoV 
antigen. Examples of such methods comprise contacting the 
sample with a SARS-CoV-specific binding agent under con- 
ditions whereby an antigen/binding agent complex can form; 
and detecting formation of the complex, thereby detecting 
SARS-CoV antigen in a sample and/or diagnosing SARS- 
CoV infection in a subject. It is contemplated that at least 
certain antigens will be on an intact SARS-CoV virion, will 
be aSARS-CoV-encoded protein displayed on the surface of 
a SARS-CoV-infected cell expressing the antigen, or will be 
a fragment of the antigen. Contemplated samples subject to 
analysis by these methods can comprise any sample, such as 
a clinical sample, useful for detection of viral infection in a 
subject. 

Methods for detecting antigens in a sample are discussed, 
for example, in Ausubel et al. Short Protocols in Molecular 
Biology, 4” ed., John Wiley & Sons, Inc., 1999. Enzyme 
immunoassays such as IFA, ELISA and immunoblotting can 
be readily adapted to accomplish the detection of SARS-CoV 
antigens according to the methods of this disclosure. An 
ELISA method effective for the detection of soluble SARS- 
CoV antigens is the direct competitive ELISA. This method is 
most useful when a specific SARS-CoV antibody and puri- 
fied SARS-CoV antigen are available. Briefly: 1) coat a sub- 
strate (for example, a microtiter plate) with a sample sus- 
pected of containing a SARS-CoV antigen; 2) contact the 
bound SARS-CoV antigen with a SARS-CoV-specific anti- 
body bound to a detectable moiety (for example, horseradish 
peroxidase enzyme or alkaline phosphatase enzyme); 3) add 
purified inhibitor SARS-CoV antigen; 4) contact the above 
with the substrate for the enzyme; and 5) observe/measure 
inhibition of color change or fluorescence and quantitate anti- 
gen concentration (for example, using a microtiter plate 
reader). 

An additional ELISA method effective for the detection of 
soluble SARS-CoV antigens is the antibody-sandwich 
ELISA. This method is frequently more sensitive in detecting 
antigen than the direct competitive ELISA method. Briefly: 
1) coat a substrate (for example, a microtiter plate) with a 
SARS-CoV-specific antibody; 2) contact the bound SARS- 
CoV antibody with a sample suspected of containing a 
SARS-CoV antigen; 3) contact the above with SARS-CoV- 
specific antibody bound to a detectable moiety (for example, 
horseradish peroxidase enzyme or alkaline phosphatase 
enzyme); 4) contact the above with the substrate for the 
enzyme; and 5) observe/measure color change or fluores- 
cence and quantitate antigen concentration (for example, 
using a microtiter plate reader). 

An ELISA method effective for the detection of cell-sur- 
face SARS-CoV antigens is the direct cellular ELISA. 
Briefly, cells suspected of exhibiting a cell-surface SARS- 
CoV antigen are fixed (for example, using glutaraldehyde) 
and incubated with a SARS-CoV-specific antibody bound to 
a detectable moiety (for example, horseradish peroxidase 
enzyme or alkaline phosphatase enzyme). Following a wash 
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to remove unbound antibody, substrate for the enzyme is 
added and color change or fluorescence is observed/mea- 
sured. 

The present disclosure further provides methods of detect- 
ing a SARS-CoV-reactive antibody in a sample, and/or diag- 
nosing SARS-CoV infection in a subject by detecting a 
SARS-CoV-reactive antibody. Examples of such methods 
comprise contacting the sample with a SARS-CoV polypep- 
tide of this disclosure under conditions whereby a polypep- 
tide/antibody complex can form; and detecting formation of 
the complex, thereby detecting SARS-CoV antibody in a 
sample and/or diagnosing SARS-CoV infection in a subject. 
Contemplated samples subject to analysis by these methods 
can comprise any sample, such as a clinical sample, as 
described herein as being useful for detection of viral infec- 
tion in a subject. 

Methods for detecting antibodies in a sample are discussed, 
for example, in Ausubel et al. Short Protocols in Molecular 
Biology, 4" ed., John Wiley & Sons, Inc., 1999. Enzyme 
immunoassays such as IFA, ELISA and immunoblotting can 
be readily adapted to accomplish the detection of SARS-CoV 
antibodies according to the methods of this disclosure. An 
ELISA method effective for the detection of specific SARS- 
CoV antibodies is the indirect ELISA method. Briefly: 1) 
bind a SARS-CoV polypeptide to a substrate (for example, a 
microtiter plate); 2) contact the bound polypeptide with a 
sample suspected of containing SARS-CoV antibody; 3) con- 
tact the above with a secondary antibody bound to a detect- 
able moiety which is reactive with the bound antibody (for 
example, horseradish peroxidase enzyme or alkaline phos- 
phatase enzyme); 4) contact the above with the substrate for 
the enzyme; and 5) observe/measure color change or fluores- 
cence. 

Another immunologic technique that can be useful in the 
detection of SARS-CoV antibodies uses monoclonal anti- 
bodies for detection of antibodies specifically reactive with 
SARS-CoV polypeptides in a competitive inhibition assay. 
Briefly, a sample suspected of containing SARS-CoV anti- 
bodies is contacted with a SARS-CoV polypeptide of this 
disclosure which is bound to a substrate (for example, a 
microtiter plate). Excess sample is thoroughly washed away. 
A labeled (for example, enzyme-linked, fluorescent, radioac- 
tive, and the like) monoclonal antibody specific for the 
SARS-CoV polypeptide is then contacted with any previ- 
ously formed polypeptide-antibody complexes and the 
amount of monoclonal antibody binding is measured. The 
amount of inhibition of monoclonal antibody binding is mea- 
sured relativeto a control (no monoclonal antibody), allowing 
for detection and measurement ofantibody in the sample. The 
degree of monoclonal antibody inhibition can be a very spe- 
cific assay for detecting SARS-CoV. Monoclonal antibodies 
can also be used for direct detection of SARS-CoV in cells or 
tissue samples by, for example, IFA analysis according to 
standard methods. 

As a further example, a micro-agglutination test can be 
used to detect the presence of SARS-CoV antibodies in a 
sample. Briefly, latex beads, red blood cells or other agglu- 
tinable particles are coated with a SARS-CoV polypeptide of 
this disclosure and mixed with a sample, such that antibodies 
in the sample that are specifically reactive with the antigen 
crosslink with the antigen, causing agglutination. The agglu- 
tinated polypeptide-antibody complexes form a precipitate, 
visible with the naked eye or measurable by spectrophotom- 
eter. In a modification of the above test, SARS-CoV-specific 
antibodies of this disclosure can be bound to the agglutinable 
particles and SARS-CoV antigen in the sample thereby 
detected. 
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VII. Pharmaceutical and Immune Stimulatory 
Compositions and Uses Thereof 


Pharmaceutical compositions including SARS-CoV 
nucleic acid sequences, SARS-CoV polypeptides, or anti- 
bodies that bind these polypeptides, are also encompassed by 
the present disclosure. These pharmaceutical compositions 
include a therapeutically effective amount of one or more 
SARS-CoV polypeptides, one or more nucleic acid mol- 
ecules encoding a SARS-CoV polypeptide, or an antibody 
that binds a SARS-CoV polypeptide, in conjunction with a 
pharmaceutically acceptable carrier. 

Disclosed herein are substances suitable for use as immune 
stimulatory compositions for the inhibition or treatment of 
SARS. Particular immune stimulatory compositions are 
directed against SARS-CoV, and include antigens obtained 
from SARS-CoV. In one embodiment, an immune stimula- 
tory composition contains attenuated SARS-CoV. Methods 
of viral attenuation are well known in the art, and include, but 
are not limited to, high serial passage (for example, in sus- 
ceptible host cells under specific environmental conditions to 
select for attenuated virions), exposure to a mutagenic agent 
(for example, a chemical mutagen or radiation), genetic engi- 
neering using recombinant DNA technology (for example, 
using gene replacement or gene knockout to disable one or 
more viral genes), or some combination thereof. 

In another embodiment, the immune stimulatory compo- 
sition contains inactivated SARS-CoV. Methods of viral inac- 
tivation are well known in the art, and include, but are not 
limited to, heat and chemicals (for example, formalin, fl-pro- 
piolactone, and ethylenimines). 

In yet another embodiment, the immune stimulatory com- 
position contains a nucleic acid vector that includes SARS- 
CoV nucleic acid molecules described herein, or that includes 
a nucleic acid sequence encoding an immunogenic polypep- 
tide or polypeptide fragment of SARS-CoV or derived from 
SARS-CoV, such as a polypeptide that encodes a surface 
protein of SARS-CoV. 

In a further embodiment, the immune stimulatory compo- 
sition contains a SARS-CoV subunit, such as glycoprotein, 
major capsid protein, or other gene products found to elicit 
humoral and/or cell mediated immune responses. 

The provided immune stimulatory SARS-CoV polypep- 
tides, constructs or vectors encoding such polypeptides, are 
combined with a pharmaceutically acceptable carrier or 
vehicle for administration as an immune stimulatory compo- 
sition to human or animal subjects. In some embodiments, 
more than one immune stimulatory SARS-CoV polypeptide 
may be combined to form a single preparation. 

The immunogenic formulations may be conveniently pre- 
sented in unit dosage form and prepared using conventional 
pharmaceutical techniques. Such techniques include the step 
of bringing into association the active ingredient and the 
pharmaceutical carrier(s) or excipient(s). In general, the for- 
mulations are prepared by uniformly and intimately bringing 
into association the active ingredient with liquid carriers. 
Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which 
may contain anti-oxidants, buffers, bacteriostats and solutes 
which render the formulation isotonic with the blood of the 
intended recipient; and aqueous and non-aqueous sterile sus- 
pensions which may include suspending agents and thicken- 
ing agents. The formulations may be presented in unit-dose or 
multi-dose containers, for example, sealed ampules and vials, 
and may be stored in a freeze-dried (lyophilized) condition 
requiring only the addition of a sterile liquid carrier, for 
example, water for injections, immediately prior to use. 
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Extemporaneous injection solutions and suspensions mav be 
prepared from sterile powders, granules and tablets com- 
monly used by one of ordinary skill in the art. 

In certain embodiments, unit dosage formulations are 
those containing a dose or unit, or an appropriate fraction 
thereof, of the administered ingredient. It should be under- 
stood that in addition to the ingredients particularly men- 
tioned above, formulations encompassed herein mav include 
other agents commoniv used bv one of ordinarv skill in the 
art. 

The compositions provided herein, including those for use 
as immune stimulatorv compositions, mav be administered 
through different routes, such as oral, including buccal and 
sublingual, rectal, parenteral, aerosol, nasal, intramuscular, 
subcutaneous, intradermal, and topical. Thev mav be admin- 
istered in different forms, including but not limited to solu- 
tions, emulsions and suspensions, microspheres, particles, 
microparticles, nanoparticles, and liposomes. 

The volume of administration will varv depending on the 
route of administration. Bv wav of example, intramuscular 
injections may range from about 0.1 ml to about 1.0 ml. Those 
of ordinary skill in the art will know appropriate volumes for 
different routes of administration. 

A relatively recent development in the field of immune 
stimulatorv compounds (for example, vaccines) is the direct 
injection of nucleic acid molecules encoding peptide antigens 
(broadly described in Janeway & Travers, Immunobiology: 
The Immune System In Health and Disease, page 13.25, Gar- 
land Publishing, Inc., New York, 1997; and McDonnell & 
Askari, N. Engl. J. Med. 334:42-45, 1996). Vectors that 
include nucleic acid molecules described herein, or that 
include a nucleic acid sequence encoding an immunogenic 
SARS-CoV polypeptide may be utilized in such DNA vacci- 
nation methods. 


Thus, the term “immune stimulatory composition” as used 
herein also includes nucleic acid vaccines in which a nucleic 
acid molecule encoding a SARS-CoV polypeptide is admin- 
istered to a subject in a pharmaceutical composition. For 
genetic immunization, suitable delivery methods known to 
those skilled in the art include direct injection of plasmid 
DNA into muscles (Wolff et al., Hum. Mol. Genet. 1:363, 
1992), delivery of DNA complexed with specific protein car- 
riers (Wu et al., J. Biol. Chem. 264:16985, 1989), co-precipi- 
tation of DNA with calcium phosphate (Benvenisty and 
Reshef, Proc. Natl. Acad. Sci. 83:9551, 1986), encapsulation 
of DNA in liposomes (Kaneda et al., Science 243:375, 1989), 
particle bombardment (Tang et al., Nature 356:152, 1992; 
Eisenbraun et al., DNA Cell Biol. 12:791, 1993), and in vivo 
infection using cloned retroviral vectors (Seeger et al., Proc. 
Natl. Acad. Sci. 81:5849, 1984). Similarly, nucleic acid vac- 
cine preparations can be administered via viral carrier. 


The amount of immunostimulatory compound in each dose 
of an immune stimulatory composition is selected as an 
amount that induces an immunostimulatory or immunopro- 
tective response without significant, adverse side effects. 
Such amount will vary depending upon which specific immu- 
nogen is employed and how it is presented. Initial injections 
may range from about 1 ug to about 1 mg, with some embodi- 
ments having a range of about 10 ug to about 800 ug, and still 
other embodiments a range of from about 25 ug to about 500 
ug. Following an initial administration of the immune stimu- 
latory composition, subjects may receive one or several 
booster administrations, adequately spaced. Booster admin- 
istrations may range from about 1 ug to about 1 mg, with other 
embodiments having a range of about 10 ug to about 750 ug, 
and still others a range of about 50 ug to about 500 ug. 
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Periodic boosters at intervals of 1-5 years, for instance three 
years, may be desirable to maintain the desired levels of 
protective immunity. 


It is also contemplated that the provided immunostimula- 
tory molecules and compositions can be administered to a 
subject indirectly, by first stimulating a cell in vitro, which 
stimulated cell is thereafter administered to the subject to 
elicit an immune response. Additionally, the pharmaceutical 
or immune stimulatory compositions or methods of treatment 
may be administered in combination with other therapeutic 
treatments. 


VIII. Kits 


Also provided herein are kits useful in the detection and/or 
diagnosis of SARS-CoV. This includes kits for use with 
nucleic acid and protein detection methods, such as those 
disclosed herein. 


The SARS-CoV-specific oligonucleotide primers and 
probes described herein can be supplied in the form ofa kit for 
use in detection of SARS-CoV. In such a kit, an appropriate 
amount of one or more of the oligonucleotides is provided in 
one or more containers, or held ona substrate. An oligonucle- 
otide primer or probe can be provided in an aqueous solution 
or as a freeze-dried or lyophilized powder, for instance. The 
container(s) in which the oligonucleotide(s) are supplied can 
be any conventional container that is capable of holding the 
supplied form, for instance, microfuge tubes, ampoules, or 
bottles. In some applications, pairs of primers are provided in 
pre-measured single use amounts in individual (typically dis- 
posable) tubes or equivalent containers. With such an 
arrangement, the sample to be tested for the presence of a 
SARS-CoV nucleic acid can be added to the individual tubes 
and amplification carried out directly. 


The amount of each oligonucleotide supplied in the kit can 
be any appropriate amount, and can depend on the market to 
which the product is directed. For instance, if the kit is 
adapted for research or clinical use, the amount of each oli- 
gonucleotide primer provided would likely be an amount 
sufficient to prime several PCR amplification reactions. Gen- 
eral guidelines for determining appropriate amounts can be 
found, for example, in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., 2001; Ausubel et al. (eds.), Short 
Protocols in Molecular Biology, John Wiley and Sons, New 
York, N.Y., 1999; and Innis et al., PCR Applications, Proto- 
cols for Functional Genomics, Academic Press, Inc., San 
Diego, Calif., 1999. A kit can include more than two primers, 
in order to facilitate the amplification of a larger number of 
SARS-CoV nucleotide sequences. 


In some embodiments, kits also include one or more 
reagents necessary to carry out in vitro amplification reac- 
tions, including DNA sample preparation reagents, appropri- 
ate buffers (for example, polymerase buffer), salts (for 
example, magnesium chloride), and deoxyribonucleotides 
(ANTP5). 

Kits can include either labeled or unlabeled oligonucle- 
otide primers and/or probes for use in detection of SARS- 
CoV nucleotide sequences. The appropriate sequences for 
such a probe will be any sequence that falls between the 
annealing sites of the two provided oligonucleotide primers, 
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such that the sequence that the probe is complementary to is 
amplified during the amplification reaction. 


One or more control sequences for use in the amplification 
reactions also can be supplied in the kit. In other particular 
embodiments, the kit includes equipment, reagents, and 
instructions for extracting and/or purifying nucleotides from 
a sample. 


Kits for the detection of SARS-CoV antigen include for 
instance at least one SARS-CoV antigen-specific binding 
agent (for example, a polyclonal or monoclonal antibody or 
antibody fragment). The kits may also include means for 
detecting antigen:specific binding agent complexes, for 
instance the specific binding agent may be detectably labeled. 
If the specific binding agent is not labeled, it may be detected 
by second antibodies or protein A, for example, which may 
also be provided in some kits in one or more separate con- 
tainers. Such techniques are well known. 


Another example of an assay kit provided herein is a 
recombinant SARS-CoV-specific polypeptide (or fragment 
thereof) as an antigen and an enzyme-conjugated anti-human 
antibody as a second antibody. Examples of such kits also can 
include one or more enzymatic substrates. Such kits can be 
used to test if a sample from a subject contains antibodies 
against a SARS-CoV-specific protein. 

The subject matter ofthe present disclosure is further illus- 
trated by the following non-limiting Examples. 


EXAMPLES 
Example 1 


Isolation and Characterization of SARS-CoV 


Virus Isolation and Ultrastructural Characterization 


This example describes the original isolation and charac- 
terization of a new human coronavirus from patients with 
SARS. 


A variety of clinical specimens (blood, serum, material 
from oropharyngeal swabs or washings, material from 
nasopharyngeal swabs, and tissues of major organs collected 
atautopsy) from patients meeting the case definitionofSARS 
were sent to the Centers for Disease Control and Prevention 
(CDC) as part of the etiologic investigation of SARS. These 
samples were inoculated onto a number of continuous cell 
lines, including Vero E6, NCI-H292, MDCK, LLC-MK2, and 
B95-8 cells, and into suckling ICR mice by the intracranial 
and intraperitoneal routes. All cultures were observed daily 
for CPE. Maintenance medium was replenished at day seven, 
and cultures were terminated fourteen days after inoculation. 
Any cultures exhibiting identifiable CPE were subjected to 
several procedures to identify the cause ofthe effect. Suckling 
mice were observed daily for fourteen days, and any sick or 
dead mice were further tested by preparing a brain suspension 
that was filtered and subcultured. Mice that remained well 
after fourteen days were killed, and their test results were 
recorded as negative. 

Two cell lines, Vero E6 cells and NCI-H292 cells, inocu- 
lated with oropharyngeal specimens from Patient 16 (a 46 
year old male physician with an epidemiologic link to a 
hospital with multiple SARS patients) initially showed CPE 
(Table 1) 
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Specimens from patients with SARS that were positive for SARS-CoV by one or more methods*. 


























29 
TABLE 1 
Exposure Findings 
Patient and on Chest Hospital- Serologic 
No Setting Age/Sex Radiograph ization Results 
1 Singapore, 53 yr/F Pneumonia Yes + 
hospital 
27 Hong Kong, 36 yr/F Pneumonia Wes + 
hotel 
3 Hong Kong, 22 yr/M Pneumonia Yes + 
hotel 
47 Hong Kong, 39 yr/M Pneumonia Yes + 
hotel 
5 Hong Kong, 49YrM Pneumonia Yes Not 
hotel done 
6i Hong Kong, 46 yr/M Pneumonia Yes + 
hotel 
7 Vietnam, Adult/ Pneumonia Yes — 
hospita unknown 
8 Vietnam, Adult/ Pneumonia Yes — 
hospita unknown 
9 Vietnam, Adult/ Pneumonia Yes — 
hospita unknown 
10 Vietnam, Adult/ Pneumonia Yes — 
hospita. unknown 
11 Vietnam, Adult/ Pneumonia Yes — 
hospital unknown 
12 Vietnam, Adult/ Pneumonia Yes — 
hospital unknown 
13 Vietnam, Adult/ Pneumonia Yes - 
hospital unknown 
14 Vietnam, Adult/ Pneumonia Yes - 
hospital unknown 
15 Vietnam, Adult/ Pneumonia Yes - 
hospital unknown 
16 Vietnam, 46 yrM Pneumonia Yes + 
hospita. 
17 Canada, 43 yr/M Pneumonia Yes Not 
family done 
18 Taiwan, 51 yr/F Pneumonia Yes - 
family 
19 Hong Kong,  Adul/F Pneumonia Yes * 
hotel 


Specimen Isolation RT-PCR 

Nasal, oropharyngeal — Not 

swabs done 

Nasal, swab — Not 

done 
Swab — — 
Nasal, pharyngeal swab — — 
Sputum + + 
Kidney, lung, broncho- +$ + 

alveolar lavage 
Oropharyngeal wash + + 
Oropharyngeal wash — + 
Oropharyngeal wash - + 
Oropharyngeal wash - + 
Oropharyngeal wash - + 
Oropharyngeal wash - + 
Oropharyngeal wash + + 
Oropharyngeal wash - + 
Oropharyngeal wash - + 
Nasal, oropharyngeal +4 + 
swabs 
Lung, bone marrow — 

Sputum - + 
Oropharyngeal wash - + 





*Plus signs denote positive results, and minus signs negative results. The serologic and RT-PCR assays were not necessarily performed on samples obtained 


at the same time. 
+This was a late specimen, antibody positive at first sample. 


{Travel included China, Hong Kong (hotel), and Hanoi (the patient was the index patient in the French Hospital). 


§ Isolation was from the kidney only. 
{Isolation was from the oropharyngeal only. 


The CPE in the Vero E6 cells was first noted on the fifth day 
post-inoculation; it was focal, with cell rounding and a refrac- 
tive appearance in the affected cells that was soon followed by 
cell detachment (FIG. 1A). The CPE spread quickly to 
involve the entire cell monolayer within 24 to 48 hours. Sub- 
culture of material after preparation of a master seed stock 
(used for subsequent antigen production) resulted in the rapid 
appearance of CPE, as noted above, and in complete destruc- 
tionofthe monolayer in the inoculated flasks within 48 hours. 
Similar CPE was also noted in four additional cultures: three 
cultures of respiratory specimens (two oropharyngeal washes 
and one sputum specimen) and one culture of a suspension of 
kidney tissue obtained at autopsy. In these specimens, the 
initial CPE was observed between day two and day four and, 
as noted above, the CPE rapidly progressed to involve the 
entire cell monolayer. 

Tissue culture samples showing CPE were prepared for 
electron-microscopical examination. Negative-stain elec- 
tron-microscopical specimens were prepared by drying cul- 
ture supernatant, mixed 1:1 with 2.5% paraformaldehyde, 
onto Formvarcarbon-coated grids and staining with 2% 
methylamine tungstate. Thin-section electron-microscopical 


45 


specimens were prepared by fixing a washed cell pellet with 
2.596 glutaraldehyde and embedding the cell pellet in epoxy 
resin. In addition, a master seed stock was prepared from the 
remaining culture supernatant and cells by freeze-thawing the 


so culture flask, clarifying the thawed contents by centrifugation 


at 1000xg, and dispensing the supernatant into aliquots stored 
in gas phase over liquid nitrogen. The master seed stock was 
subcultured into 850-cm? roller bottles of Vero E6 cells for the 
preparation of formalin-fixed positive control cells for immu- 


ss nohistochemical analysis, mixed with normal Vero E6 cells, 


and gamma-irradiated for preparation of spot slides for IFA 
tests or extracted with detergent and gamma-irradiated for use 
as an ELISA antigen for antibody tests. 


Examination of CPE-positive Vero E6 cells by thin-section 


60 electron microscopy revealed characteristic coronavirus par- 


ticles within the cisternae ofthe rough endoplasmic reticulum 
and in vesicles (FIG. 2A) (Becker et al., J. Virol. 1:1019-27, 
1967; Oshiro et al. J. Gen. Virol. 12:161-8, 1971). Extracel- 
lular particles were found in large clusters and adhering to the 


65 surface of the plasma membrane. Negative-stain electron 


microscopy identified coronavirus particles, 80 to 140 nm in 
diameter, with 20- to 40-nm complex surface projections 
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surrounding the periphery (FIG. 2B). Hemagglutinin 
esterase-type glycoprotein projections were not seen. 

The isolation and growth of a human-derived coronavirus 
in Vero E6 cells were unexpected. The previously known 
human coronaviruses are notably fastidious, preferring select 
cell lines, organ culture, or suckling mice for propagation. 
The only human or animal coronavirus which has been shown 
to grow in Vero E6 cells is PEDV, and it requires the addition 
of trypsin to culture medium for growth in the cells. More- 
over, PEDV adapted to growth in Vero E6 cells results in a 
strikingly different CPE, with cytoplasmic vacuoles and the 
formation of large syncytia. Syncytial cells were only 
observed occasionally in monolayers of Vero E6 cells 
infected with the SARS-CoV; they clearly do not represent 
the dominant CPE. 


Reverse Transcription-Polymerase Chain Reaction and 
Sequencing 

For RT-PCR assays, cell-culture supernatants were placed 
in lysis buffer. RNA extracts were prepared from 100 ul of 
each specimen (or culture supernatant) with the automated 
NucliSens extraction system (bioMerieux, Durham, N.C.). 
Initially, degenerate, inosine-containing primers IN-2 (+) 
S'GGGTTGGGACTA TCCTAAGTGTGAS' (SEQ ID NO: 
34) and IN-A(—) S'TAACACACAACICCATCA TCAS' (SEQ 
ID NO: 35) were designed to anneal to sites encoding con- 
served amino acid motifs that were identified on the basis of 
alignments of available coronavirus ORF la, ORF 1b, S, HE, 
M, and N gene sequences. Additional, SARS-specific, prim- 
ers Cor-p-F2 (+) S'CTAACATGCTTAGGATAATGG3' (SEQ 
ID NO: 13), Cor-p-F3 (+) S'GCCTCTCTTGTTCT- 
TGCTCGC3' (SEQ ID NO: 14), and Cor-p-R1 (-) 5' CAG- 
GTAAGCGTAAAACTCATC3' (SEQ ID NO: 15) were 
designed as sequences were generated from RT-PCR prod- 
ucts amplified with the degenerate primers. These SARS- 
specific primers were used to test patient specimens for SARS 
(see below). Primers used for specific amplification of human 
metapneumovirus have been described by Falsey et al. (J. 
Infect. Dis. 87:785-90, 2003). 

For RT-PCR products of less than 3 kb, cDNA was synthe- 
sized in a 20 ul reaction mixture containing 500 ng of RNA, 
200 U of Superscript™ II reverse transcriptase (Invitrogen 
Life Technologies, Carlsbad, Calif.), 40 U of RNasin 
(Promega Corp., Madison, Wis.), 100 mM each dNTP (Roche 
Molecular Biochemicals, Indianapolis, Ind.), 4 ul of 5x reac- 
tion buffer (Invitrogen Life Technologies, Carlsbad, Calif.), 
and 200 pmol of the reverse primer. The reaction mixture, 
except for the reverse transcriptase, was heated to 70? C. for 
2 minutes, cooled to 4? C. for 5 minutes and then heated to 42? 
C. in a thermocycler. The mixture was held at 42? C. for 4 
minutes, and then the reverse transcriptase was added, and the 
reactions were incubated at 42? C. for 45 minutes. Two micro- 
liters of the cDNA reaction was used in a 50 ul PCR reaction 
containing 67 mM Tris-HCl (pH 8.8), 1 mM each primer, 17 
mM ammonium sulfate, 6 mM EDTA, 2 mM MgCl,, 200 mM 
each dNTP, and 2.5 U of Taq DNA polymerase (Roche 
Molecular Biochemicals, Indianapolis, Ind.). The thermocy- 
cler program for the PCR consisted of 40 cycles of denatur- 
ation at 95? C. for 30 seconds, annealing at 42? C. for 30 
seconds, and extension at 65? C. for 30 seconds. For SARS- 
CoV-specific primers, the annealing temperature was 
increased to 55? C. 

For amplification of fragments longer than 3 kb, regions of 
the genome between sections of known sequence were ampli- 
fied by means of a long RT-PCR protocol and SARS-CoV- 
specific primers. First-strand cDNA synthesis was performed 
at 42? C. or 50? C. using Superscript™ II RNase H reverse 
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transcriptase (Invitrogen Life Technologies, Carlsbad, Calif.) 
according to the manufacturer's instructions with minor 
modifications. Coronavirus-specific primers (500 ng) and 
SARS-CoV RNA (350 ng) were combined with the PCR 
Nucleotide Mix (Roche Molecular Biochemicals, Indianapo- 
lis, Ind.), heated for 1 minute at 94? C., and cooled to 4? C. in 
a thermocycler. The 5x first-strand buffer, dithiothreitol (In- 
vitrogen Life Technologies, Carlsbad, Calif.), and Protector 
RNase Inhibitor (Roche Molecular Biochemicals, Indianapo- 
lis, Ind.) were added, and the samples were incubated at 42? 
C. or 50? C. for 2 minutes. After reverse transcriptase (200 U) 
was added, the samples were incubated at 42? C. or 50? C. for 
1.5 to 2 hours. Samples were inactivated at 70? C. for 15 
minutes and subsequently treated with 2 U of RNase H 
(Roche Molecular Biochemicals, Indianapolis, Ind.) at 37? C. 
for 30 minutes. Long RT-PCR amplification of 5- to 8-kb 
fragments was performed using Taq Plus Precision (Strat- 
agene, La Jolla, Calif.) and AmpliWax PCR Gem 100 beads 
(Applied Biosystems; Foster City, Calif.) for *hot start" PCR 
with the following thermocycling parameters: denaturation at 
94? C. for 1 minute followed by 35 cycles of 94? C. for 30 
seconds, 55? C. for 30 seconds, an increase of 0.4 degrees per 
second up to 72? C., and 72°C. for 7 to 10 minutes, with a final 
extension at 72? C. for 10 minutes. RT-PCR products were 
separated by electrophoresis on 0.996 agarose TAE gels and 
purified by use of a QI Aquick Gel Extraction Kit (Qiagen, 
Inc., Santa Clarita, Calif.). 

In all cases, the RT-PCR products were gel-isolated and 
purified for sequencing by means of a QIAquick Gel Extrac- 
tion kit (Qiagen, Inc., Santa Clarita, Calif.). Both strands were 
sequenced by automated methods, using fluorescent dideoxy- 
chain terminators (Applied Biosystems; Foster City, Calif.). 

The sequence of the leader was obtained from the subge- 
nomic mRNA coding for the N gene and from the 5' terminus 
of genomic RNA. The 5' rapid amplification of cDNA ends 
(RACE) technique (Harcourt et al., Virology 271:334-49, 
2000) was used with reverse primers specific forthe N geneor 
for the 5' untranslated region. RACE products were either 
sequenced directly or were cloned into a plasmid vector 
before sequencing. A primer that was specific for the leader of 
SARS-CoV was used to amplify the region between the 
5'-terminus of the genome and known sequences in the rep 
gene. The 3'-terminus of the genome was amplified for 
sequencing by use of an oligo-(dT) primer and primers spe- 
cific for the N gene. 

Once the complete SARS-CoV genomic sequence had 
been determined, it was confirmed by sequencing a series of 
independently amplified RI-PCR products spanning the 
entire genome. Positive- and negative-sense sequencing 
primers, at intervals of approximately 300 nt, were used to 
generate a confirmatory sequence with an average redun- 
dancy of 9.1. The confirmatory sequence was identical to the 
original sequence. The genomic sequence (SEQ ID NO: 1) 
was published in the GenBank sequence database (Accession 
No. AY278741) on Apr. 21, 2003. 


Sequence Analysis 

Predicted amino acid sequences were compared with those 
from reference viruses representing each species for which 
complete genomic sequence information was available: 
group 1 representatives included human coronavirus 229E 
(GenBank Accession No. AF304460), porcine epidemic diar- 
rhea virus (GenBank Accession No. AF353511), and trans- 
missible gastroenteritis virus (GenBank Accession No. 
AF271965); group 2 representatives included bovine coro- 
navirus (GenBank Accession No. AF220295) and mouse 
hepatitis virus (GenBank Accession No. AF201929); group 3 
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was represented by infectious bronchitis virus (GenBank 
Accession No. M95169). Sequences for representative 
strains of other coronavirus species for which partial 
sequence information was available were included for some 
of the structural protein comparisons: group 1 representative 5 
strains included canine coronavirus (GenBank Accession No. 
D13096), feline coronavirus (GenBank Accession No. 
AY204704), and porcine respiratory coronavirus (GenBank 
Accession No. Z24675); and group 2 representatives 
included three strains of human coronavirus OC43 (GenBank 10 
Accession Nos. M76373, L14643 and M93390), porcine 
hemagglutinating encephalomyelitis virus (GenBank Acces- 
sion No. AY078417), and rat coronavirus (GenBank Acces- 
sion No. AF207551). 

Partial nucleotide sequences of the polymerase gene were 15 
aligned with published coronavirus sequences, using 
CLUSTAL W for Unix (version 1.7; Thompson et al., Nucleic 
Acids Res. 22:4673-80, 1994). Phylogenetic trees were com- 
puted by maximum parsimony, distance, and maximum like- 
lihood-based criteria analysis with PAUP (version 4.0.d10; 20 
Swofford ed., Phylogenetic Analysis using Parsimony and 
other Methods, Sinauer Associates, Sunderland, Mass.). 
When compared with other human and animal coronaviruses, 
the nucleotide and deduced amino acid sequence from this 
region had similarity scores ranging from 0.56 to 0.63 and 25 
from 0.57 to 0.74, respectively. The highest sequence simi- 
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Gonnet protein comparison matrix. The resulting trees were 
adjusted for final output by using treetool version 2.0.1. 
Uncorrected pairwise distances were calculated from the 
aligned sequences by using the Distances program from the 
Wisconsin Sequence Analysis Package, version 10.2 (Accel- 
rys, Burlington, Mass.). Distances were converted to percent 
identity by subtracting from 100. The amino acid sequences 
for three well-defined enzymatic proteins encoded by the rep 
gene and the four major structural proteins of SARS-CoV 
were compared with those from representative viruses for 
each of the species of coronavirus for which complete 
genomic sequence information was available (FIG. 4, Table 
2). The topologies of the resulting phylograms are remark- 
ably similar (FIG. 4). For each protein analyzed, the species 
formed monophyletic clusters consistent with the established 
taxonomic groups. In all cases, SARS-CoV sequences segre- 
gated into a fourth, well-resolved branch. These clusters were 
supported by bootstrap values above 90% (1000 replicates). 
Consistent with pairwise comparisons between the previ- 
ously characterized coronavirus species (Table 2), there was 
greater sequence conservation in the enzymatic proteins 
(3CI”'”, polymerase (POL), and helicase (HEL)) than among 
the structural proteins (S, E, M, and N). These results indicate 
that SARS-CoV is not closely related to any of the previously 
characterized coronaviruses and forms a distinct group within 
the genus Coronavirus. 


TABLE 2 


——.................... 
Pairwise amino acid identities of coronavirus proteins. 


Group Virus 


Gl HCoV-229E 
PEDV 
TGEV 

G2 BCoV 
MHV 

G3 IBV 


SARS-CoV 
CoV Range 


larity was obtained with group II coronaviruses. The maxi- B 


mum-parsimony tree obtained from the nucleotide-seguence 
alignment is shown in FIG. 3. Bootstrap analvses of the 
internal nodes at the internal branches of the tree provided 
strong evidence that the SARS-CoV is genetically distinct 


; 50 
from other known coronaviruses. 


Microarray analyses (using a long oligonucleotide DNA 
microarray with array elements derived from highly con- 
served regions within viral families) of samples from infected 
and uninfected cell cultures gave a positive signal for a group 55 
of eight oligonucleotides derived from two virus families: 
Coronaviridae and Astroviridae (Wang et al, PNAS 
99:15687-92, 2002). All of the astroviruses and two of the 
coronavirus oligonucleotides share a consensus sequence 
motif that maps to the extreme 3'-end of astroviruses and two go 
members ofthe coronavirus family: avian infectious bronchi- 
tis and turkey coronavirus (Jonassen et al., J. Gen. Virol. 
79:715-8, 1998). Results were consistent with the identity of 
the isolate as a coronavirus. 

Additional sequence alignments and neighbor-joining 65 


trees were generated by using ClustalX (Thompson et al., 
Nucleic Acids Res. 25:4876-82, 1997), version 1.83, with the 


3CLPRO POL HEL S E M N 


Pairwise Amino Acid Identity (Percent) 


40.1 58.8 59.7 23.9 22.7 28.8 23.0 
444 59.5 61.7 21:1 17.6 31.8 22.6 
44.0 59.4 61.2 20.6 22.4 30.0 25.6 
48.8 66.3 68.3 27.1 20.0 39.7 31.9 
49.2 66.5 67.3 26.5 21.1 39.0 33.0 
41.3 62.5 58.6 21.8 18.4 27.2 24.0 


306 932 601 1255 76 221 422 
302-307 923-940 506-600 1173-1452 76-108 225-262 377-454 


Example 2 


Detection of SARS-CoV in a Subject 


This example demonstrates the detection of SARS-CoV in 
patient specimens using SARS-CoV-specific primers. 


The SARS-specific primers Cor-p-F2 (SEQ ID NO: 13), 
Cor-p-F3 (SEQ ID NO: 14) and Cor-p-R1 (SEQ ID NO: 15) 
were used to test patient specimens for SARS. One primer for 
each set was 5'-end-labeled with 6-FAM to facilitate GeneS- 
can analysis. One-step amplification reactions were per- 
formed with the Access RT-PCR System (Promega, Madison, 
Wis.) as described by Falsev et al., J. Infect. Dis. 87:785-90, 
2003. Positive and negative RT-PCR controls, containing 
standardized viral RNA extracts, and nuclease-free water 
were included in each run. Amplified 6-FAM-labeled prod- 
ucts were analyzed by capillary electrophoresis on an ABI 
3100 Prism Genetic Analyzer with GeneScan software (ver- 
sion 3.1.2; Applied Biosystems; Foster City, Calif.). Speci- 
mens were considered positive for SARS-CoV if the ampli- 
fication products were within one nucleotide of the expected 
product size (368 nucleotides for Cor-p-F2 or Cor-p-R1 and 
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348 nucleotides for Cor-p-F3 or Cor-p-R1) for both specific 
primer sets, as confirmed by a second PCR reaction from 
another aliquot of RNA extract in a separate laboratory. 
Where DNA yield was sufficient, the amplified products were 
also sequenced. Additionally, as described above, microar- 
ray-based detection of SARS-CoV in patient specimens was 
carried out (Wang et al., PNAS 99:15687-92, 2002 and 
Bohlander et al., Genomics 13:1322-24, 1992). 


Example 3 


Immunohistochemical and Histopathological 
Analysis, and Electron-Microscopical Analysis of 
Bronchoalveolar Lavage Fluid 


This example illustrates immunohistochemical, histo- 
pathological and electron-microscopical analysis of Vero E6 
cells infected with the SARS-CoV and tissue samples from 
SARS patients. 

Formalin-fixed, paraffin-embedded Vero E6 cells infected 
with the SARS-CoV and tissues obtained from patients with 
SARS were stained with hematoxylin and eosin and various 
immunohistochemical stains. Immunohistochemical assays 
were based on a method described previously for hantavirus 
(Zaki et al., Amer. J. Pathol. 146:552-79, 1995). Briefly, 4-um 
sections were deparaffinized, rehydrated, and digested in Pro- 
teinase K for 15 minutes. Slides were then incubated for 60 
minutes at room temperature with monoclonal antibodies, 
polyclonal antiserum or ascitic fluids derived from animal 
species with reactivities to various known coronaviruses, and 
with a convalescent-phase serum specimen from a patient 
with SARS. 

Optimal dilutions of the primary antibodies were deter- 
mined by titration experiments with coronavirus-infected 
cells from patients with SARS and with noninfected cells or, 
when available, with concentrations recommended by the 
manufacturers. After sequential application of the appropri- 
ate biotinylated link antibody, avidin-alkaline phosphatase 
complex, and naphthol-fast red substrate, sections were coun- 
terstained in Mayer’s hematoxylin and mounted with aque- 
ous mounting medium. The following antibody and tissue 
controls were used: serum specimens from noninfected ani- 
mals, various coronavirus-infected cell cultures and animal 
tissues, noninfected cell cultures, and normal human and 
animal tissues. Tissues from patients were also tested by 
immunohistochemical assays for various other viral and bac- 
terial pulmonary pathogens. In addition, a BAL specimen was 
available from one patient for thin-section electron-micro- 
scopical evaluation. 

Lung tissues were obtained from the autopsy of three 
patients and by open lung biopsy of one patient, 14-19 days 
following onset of SARS symptoms. Confirmatory labora- 
tory evidence of infection with coronavirus was available for 
two patients (patients 6 and 17) and included PCR amplifica- 
tion of coronavirus nucleic acids from tissues, viral isolation 
from BAL fluid or detection of serum antibodies reactive with 
coronavirus (Table 1). For two patients, no samples were 
available for molecular, cell culture, or serological analysis; 
however, both patients met the CDC definition for probable 
SARS cases and had strong epidemiologic links with labora- 
tory-confirmed SARS cases. Histopathologic evaluation of 
lung tissues of the four patients showed diffuse alveolar dam- 
age at various levels of progression and severity. Changes 
included hyaline membrane formation, interstitial mono- 
nuclear inflammatory infiltrates, and desquamation of pneu- 
mocytes in alveolar spaces (FIG. 5A). Other findings identi- 
fied in some patients included focal intraalveolar 
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hemorrhage, necrotic inflammatory debris in small airways, 
and organizing pneumonia. Multinucleated syncytial cells 
were identified in the intraalveolar spaces of two patients who 
died 14 and 17 days, respectively, after onset of illness. These 
cells contained abundant vacuolated cytoplasm with cleaved 
and convoluted nuclei. No obvious intranuclear or intracyto- 
plasmic viral inclusions were identified (FIG. 5B), and elec- 
tron-microscopical examination of a limited number of these 
syncytial cells revealed no coronavirus particles. No defini- 
tive immunostaining was identified in tissues from SARS 
patients with the use of a battery of immunohistochemical 
stains reactive with coronaviruses from antigenic groups I, II, 
and III. In addition, no staining of patient tissues was identi- 
fied with the use of immunohistochemical stains for influen- 
zaviruses A and B, adenoviruses, Hendra and Nipah viruses, 
human metapneumovirus, respiratory syncytial virus, 
measles virus, Mycoplasma pneumoniae, and Chlamydia 
pneumoniae. 

Evaluation of Vero E6 cells infected with coronavirus iso- 
lated from a patient with SARS revealed viral CPE that 
included occasional multinucleated syncytial cells but no 
obvious viral inclusions (FIG. 5C). Immunohistochemical 
assays with various antibodies reactive with coronaviruses 
from antigenic group I, including HCoV-229E, FIPV and 
TGEV, and with an immune serum specimen from a patient 
with SARS, demonstrated strong cytoplasmic and membra- 
nous staining of infected cells (FIG. 5C and Table 3); how- 
ever, cross-reactivity with the same immune human serum 
sample and FIPV antigen was not observed. No staining was 
identified with any of several monoclonal or polyclonal anti- 
bodies reactive with coronaviruses in antigenic group II 
(HCoV-0C43, BCoV and MHV) or group III (TCoV and 
IBV-Avian). Electron microscopical examination of a BAL 
fluid from one patient revealed many coronavirus-infected 
cells (FIGS. 6A-B). 


TABLE 3 


Immunohistochemical reactivities of various polyclonal group I anti- 
coronavirus reference antiserum samples with a coronavirus isolated from 
a patient with SARS and with selected antigenic group I coronaviruses. 


Immunohistochemical reactivity of antiserum with 
coronavirus-infected culture cells 


HCoV-229E 
SARS-CoV (mouse 3T3- FIPV-1 
Antiserum (Vero E6) hAPN) (BHK-fAPN) 
Convalescent- + + - 
phase SARS 
(patient 3) 
Guinea pig anti- + + - 
HCoV-229E 
Rabbit anti + + + 
HCoV-229E 
Feline anti-FIPV- + + + 
1 
Porcine anti- + - + 
TGEV 
Example 4 


SARS-CoV Serologic Analysis 


This example illustrates representative methods of per- 
forming serological analysis of SARS-CoV. 

Spot slides were prepared by applying 15 ul of the suspen- 
sion of gamma-irradiated mixed infected and noninfected 
cells onto 12-well Teflon-coated slides. Slides were allowed 


US 7,776,521 BI 


37 

to air dry before being fixed in acetone. Slides were then 
stored at -70° C. until used for IFA tests (Wulff and Lange, 
Bull. WHO 52:429-36, 1975). An ELISA antigen was pre- 
pared by detergent extraction and subsequent gamma irradia- 
tion of infected Vero E6 cells (Ksiazek et al., J. Infect. Dis. 
179 (suppl. 1):S191-8, 1999). The optimal dilution (1:1000) 
for the use of this antigen was determined by checkerboard 
titration against SARS patient serum from the convalescent 
phase; a control antigen, similarly prepared from uninfected 
Vero E6 cells, was used to control for specific reactivity of 
tested sera. The conjugates used were goat antihuman IgG, 
IgA, and IgM conjugated to fluorescein isothiocyanate and 
horseradish peroxidase (Kirkegaard and Perry, Gaithersburg, 
Md.), for the IFA test and ELISA, respectively. Specificity 
and cross-reactivity of a variety of serum samples to the 
newly identified virus were evaluated by using the tests 
described herein. For this evaluation, serum from SARS 
patients in Singapore, Bangkok and Hong Kong was used, 
along with serum from healthy blood donors from the CDC 
serum bank and from persons infected with known human 
coronavirus (human coronaviruses OC43 and 229E) 
(samples provided by E. Walsh and A. Falsey, University of 
Rochester School of Medicine and Dentistry, Rochester, 
NY. 


Spot slides with infected cells reacted with serum from 
patients with probable SARS in the convalescent phase (FIG. 
1B). Screening of a panel of serum from patients with sus- 
pected SARS from Hong Kong, Bangkok, Singapore as well 
as the United States showed a high level of specific reaction 
with infected cells, and conversion from negative to positive 
reactivity or diagnostic rises in the IFA test by a factor of four. 
Similarly, tests ofthese same serum samples with the ELISA 
antigen showed high specific signal in the convalescent-phase 
samples and conversion from negative to positive antibody 
reactivity or diagnostic increases in titer (Table 4). 


TABLE 4 


Results of serological testing with both IFA assay and ELISA in SARS 
patients tested against the newly isolated human coronavirus. 














Serum Days After 
Source No. Onset ELISA Titer* IFA Titer* 
Hong Kong 1.1 4 <100 <25 
Hong Kong 1.2 13 26400 1600 
Hong Kong 2 11 400 100 
Hong Kong 2.2 16 1600 200 
Hong Kong 3.1 7 <100 <25 
Hong Kong 3.2 17 26400 800 
Hong Kong 4.1 8 <100 <25 
Hong Kong 4.2 13 1600 50 
Hong Kong EI 10 100 x25 
Hong Kong 2:2 17 26400 1600 
Hong Kong 6.1 12 1600 200 
Hong Kong 6.2 20 26400 6400 
Hong Kong 7:1 17 400 50 
Hong Kong 7.2 24 26400 3200 
Hong Kong 8. 3 <100 <25 
Hong Kong 8.2 15 26400 200 
Hong Kong (Hanoi) 9. 5 <100 <25 
Hong Kong 9.2 11 26400 1600 
Bangkok 1. 2 «100 <25 
Bangkok 1.2 4 <100 <25 
Bangkok 13 7 «100 <25 
Bangkok 1.4 15 1600 200 
United States T. 2 «100 <25 
United States 12 6 400 50 
United States 1:3 13 26400 800 
Singapore 1. 2 100 <25 
Singapore 12 11 6400 800 
Singapore 2: 6 100 <25 
Singapore 2.2 25 26400 400 
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TABLE 4-continued 


Results of serological testing with both IFA assay and ELISA in SARS 
patients tested against the newly isolated human coronavirus. 


Serum Days After 


Source No. Onset ELISA Titer* IFA Titer* 
Singapore 3.1 6 100 <25 
Singapore 3.2 14 26400 400 
Singapore 4.1 5 100 <25 
Singapore 4.2 16 1600 400 


*Reciprocal of dilution 


Information from the limited numbers of samples tested 
suggests that antibody is first detectable by IFA assay and 
ELISA between one and two weeks after the onset of symp- 
toms in the patient. IFA testing and ELISA of a panel of 384 
randomly selected serum samples (from U.S. blood donors) 
were negative for antibodies to the new coronavirus, with the 
exception of one specimen that had minimal reactivity on 
ELISA. A panel of paired human serum samples with diag- 
nostic increases (by a factor of four or more) in antibody (with 
very high titers to the homologous viral antigen in the conva- 
lescent-phase serum) to the two known human coronaviruses, 
OC43 (13 pairs) and 229E (14 pairs), showed no reactivity in 
either acute- or convalescent-phase serum with the newly 
isolated coronavirus by either the IFA test or the ELISA. 


Example 5 
Poly(A)*RNA Isolation and Northern Hybridization 


This example illustrates a representative method of North- 
ern hybrididization to detect SARS-CoV messages in Vero E6 
cells. 

Total RNA from infected or uninfected Vero E6 cells was 
isolated with Trizol reagent (Invitrogen Life Technologies, 
Carlsbad, Calif.) used according to the manufacturer’s rec- 
ommendations. Poly(A)*RNA was isolated from total RNA 
by use of the Oligotex Direct mRNA Kit (Qiagen, Inc., Santa 
Clarita, Calif.), following the instructions for the batch pro- 
tocol, followed by ethanol precipitation. RNA isolated from 1 
cm? of cells was separated by electrophoresis on a 0.9% 
agarose gel containing 3.7% formaldehyde, followed by par- 
tial alkaline hydrolysis (Ausubel et al. eds. Current Protocols 
in Molecular Biology, vol. 1, John Wiley & Sons, Inc., NY, 
N.Y., Ch. 4.9, 1996). RNA was transferred to a nylon mem- 
brane (Roche Molecular Biochemicals, Indianapolis, Ind.) by 
vacuum blotting (Bio-Rad, Hercules, Calif.) and fixed by UV 
cross-linking. The DNA template for probe synthesis was 
generated by RT-PCR amplification of SARS-CoV nt 29,083 
to 29,608 (SEQ ID NO: 1), by using a reverse primer con- 
taining a T7 RNA polymerase promoter to facilitate genera- 
tion of a negative-sense riboprobe. In vitro transcription of the 
digoxigenin-labeled riboprobe, hybridization, and detection 
of the bands were carried out with the digoxigenin system by 
using manufacturer’s recommended procedures (Roche 
Molecular Biochemicals, Indianapolis, Ind.). Signals were 
visualized by chemiluminescence and detected with x-ray 
film. 


Example 6 
SARS-CoV Genome Organization 
This example illustrates the genomic organization of the 


SARS-CoV genome, including the location of SARS-CoV 
ORFs. 
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The genome of SARS-CoV is a 29,727-nucleotide, poly- 
adenylated RNA, and 41% of the residues are G or C (range 
for published coronavirus complete genome sequences, 37% 
to 42%). The genomic organization is typical of coronavi- 
ruses, having the characteristic gene order [5'-replicase (rep), 
spike (S), envelope (E), membrane (M), nucleocapsid (N)-3'] 
and short untranslated regions at both termini (FIG. 7A, Table 
5). The SARS-CoV rep gene, which comprises approxi- 
mately two-thirds of the genome, encodes two polyproteins 
(encoded by ORF la and ORF 1b) that undergo co-transla- 
tional proteolytic processing. There are four ORFs down- 
stream of rep that encode the structural proteins, S, E, M, and 
N, which are common to all known coronaviruses. The 
hemagglutinin-esterase gene, which is present between 
ORF 1b and S in group 2 and some group 3 coronaviruses (Lai 
and Holmes, in Fields Virology, eds. Knipe and Howley, 
Lippincott Williams and Wilkins, New York, 4” edition, 
2001, Ch. 35), was not found in SARS-CoV. 

Coronaviruses also encode a number of non-structural pro- 
teins that are located between S and E, between M and N, or 
downstream of N. These non-structural proteins, which vary 
widely among the different coronavirus species, are of 
unknown function and are dispensable for virus replication 
(Lai and Holmes, in Fields Virology, eds. Knipe and Howley, 
Lippincott Williams and Wilkins, New York, 4” edition, 
2001, Ch. 35). The genome of SARS-CoV contains ORFs for 
five non-structural proteins of greater than 50 amino acids 
(FIG. 7B, Table 5). Two overlapping ORFs encoding proteins 
of 274 and 154 amino acids (termed X1 (SEQ ID NO: 5) and 
X2 (SEQ ID NO: 6), respectively) are located between S 
(SEQ ID NO: 4) and E (SEQ ID NO: 7). Three additional 
non-structural genes, X3 (SEQ ID NO: 9), X4 (SEQ ID NO: 
10), and X5 (SEQ ID NO: 11) (encoding proteins of 63, 122, 
and 84 amino acids, respectively), are located between M 
(SEQ ID NO: 8) and N (SEQ ID NO: 12). In addition to the 
five ORFs encoding the non-structural proteins described 
above, there are also two smaller ORFs between M and N, 
encoding proteins of less than 50 amino acids. Searches of the 
GenBank database (BLAST and FastA) indicated that there is 
no significant sequence similarity between these non-struc- 
tural proteins of SARS-CoV and any other proteins. 


The coronavirus rep gene products are translated from 
genomic RNA, but the remaining viral proteins are translated 
from subgenomic mRNAs that form a 3'-coterminal nested 
set, each with a 5'-end derived from the genomic 5'-leader 
sequence. The coronavirus subgenomic mRNAs are synthe- 
sized through a discontinuous transcription process, the 
mechanism of which has not been unequivocally established 
(Lai and Holmes, in Fields Virology, eds. Knipe and Howley, 
Lippincott Williams and Wilkins, New York, 4” edition, 
2001, Ch. 35; Sawicki and Sawicki, Adv. Exp. Med. Biol. 
440:215-19, 1998). The SARS-CoV leader sequence was 
mapped by comparing the sequence of 5'-RACE products 
synthesized from the N gene mRNA with those synthesized 
from genomic RNA. A sequence, AAACGAAC (nucleotides 
65-72 of SEQ ID NO: 1), was identified immediately 
upstream of the site where the N gene mRNA and genomic 
sequences diverged. This sequence was also present upstream 
of ORFla and immediately upstream of five other ORFs 
(Table 5), suggesting that it functions as the conserved core of 
the transcriptional regulatory sequence (TRS). 
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In addition to the site at the 5'-terminus of the genome, the 
TRS conserved core sequence appears six times in the 
remainder of the genome. The positions of the TRS in the 
genome of SARS-CoV predict that subgenomic mRNAs of 
8.3, 4.5, 3.4, 2.5, 2.0, and 1.7 kb, not including the poly(A) 
tail, should be produced (FIGS. 7A-B, Table 5). At least five 
subgenomic mRNAs were detected by Northern hybridiza- 
tion of RNA from SARS-CoV-infected cells, using a probe 
derived from the 3'-untranslated region (FIG. 7C). The cal- 
culated sizes of the five predominant bands correspond to the 
sizes of five of the predicted subgenomic mRNAs of SARS- 
CoV; the possibility that other, low-abundance mRNAs are 
present cannot be excluded. By analogy with other coronavi- 
ruses (Lai and Holmes, in Fields Virology, eds. Knipe and 
Howley, Lippincott Williams and Wilkins, New York, 4” 
edition, 2001, Ch. 35), the 8.3-kb and 1.7-kb subgenomic 
mRNAs are monocistronic, directing translation of S and N, 
respectively, whereas multiple proteins are translated from 
the 4.5-kb (X1, X2, and E), 3.4-kb (M and X3), and 2.5-kb 
(X4 and X5) mRNAs. A consensus TRS is not found directly 
upstream of the ORF encoding the predicted E protein, anda 
monocistronic mRNA that would be predicted to code for E 
could not be clearly identified by Northern blot analysis. It is 
possible that the 3.6-kb band contains more than one mRNA 
species or that the monocistronic mRNA for E is a low- 
abundance message. 


TABLE 5 


Locations of SARS-CoV ORFs and sizes of proteins and mRNAs 


Genome Location Predicted Size 


ORF TRS? ORF Start ORFEnd Protein (aa) mRNA (nt)? 
la 72 265 13,398 4,378 29,727 
1b 13,398 21,482 2,695 
S 21491 21492 25,256 1,255 8,308* 
X1 25,265 25,268 26,089 274 4,534* 
X2 25,689 26,150 154 
E 26,117 26,344 76 
M 26,353 26,398 27,060 221 3,446* 
X3 27,074 27,262 63 
X4 27272 27,273 27,638 122 2,527 
X5 27,778 27,864 28,115 84 2,0217 
N 28,111 28,120 29,385 422 1,688” 


“The location is the 3'-most nucleotide in the consensus TRS, AAACGAAC, 
5Not including poly(A). Predicted size is based on the position of the conserved TRS. 
“Corresponding mRNA detected by Northern blot analysis (FIG. 7C) 


“No mRNA corresponding to utilization of this consensus TRS was detected by Northern 
blot analysis (FIG. 7C) 


Example 7 
Real-Time RT-PCR Assay for SARS-CoV Detection 


This example demonstrates the use of SARS-CoV-specific 
primers and probes in a real-time RI-PCR assay to detect 
SARS-CoV in patient specimens. 

A variant of the real-time format, based on TaqMan probe 
hydrolysis technology (Applied Biosystems, Foster City, 
Calif.), was used to analyze a total of 340 clinical specimens 
collected from 246 persons with confirmed or suspected 
SARS-CoV infection. Specimens included oro- and nasopha- 
ryngeal swabs (dry and in viral transport media), sputa, nasal 
aspirates and washes, BAL, and lung tissue specimens col- 
lected at autopsy. 
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Nucleic Acid Extraction 

SARS-CoV nucleic acids were recovered from clinical 
specimens using the automated NucliSens extraction system 
(bioMerieux, Durham, N.C.). Following manufacturer's 
instructions, specimens received in NucliSens lysis buffer 
were incubated at 37° C. for 30 min with intermittent mixing, 
and 50 uL of silica suspension, provided in the extraction kit, 
was added and mixed. The contents of the tube were then 
transferred to a nucleic acid extraction cartridge and pro- 
cessed on an extractor workstation. Approximately 40-50 uL 
of total nucleic acid eluate was recovered into nuclease-free 
vials and either tested immediately or stored at -70° C. 


Primers and Probes 

Multiple primer and probe sets were designed from the 
SARS-CoV polymerase 1b (nucleic acid 13,398 to 21,482 of 
SEQ ID NO: 1) and nucleocapsid gene (nucleic acid 28,120 to 
29,385 of SEQ ID NO: 1) sequences by using Primer Express 
software version 1.5 or 2.0.0 (Applied Biosystems, Foster 
City, Calif.) with the following default settings: primer melt- 
ing temperature (T,,) set at 60? C.; probe T,, set at 10? C. 
greater than the primers at approximately 70? C.; and no 
guanidine residues permitted at the 5' probe termini. All prim- 
ers and probes were synthesized by standard phosphoramid- 
ite chemistry techniques. TaqMan probes were labeled at the 
5'-end with the reporter 6-FAM and at the 3'-end with the 
quencher Blackhole Quencher 1 (Biosearch Technologies, 
Inc., Novato, Calif.). Optimal primer and probe concentra- 
tions were determined by cross-titration of serial twofold 
dilutions of each primer against a constant amount of purified 
SARS-CoV RNA. Primer and probe concentrations that gave 
the highest amplification efficiencies were selected for fur- 
ther study (Table 6). 


TABLE 6 


um 


0 


42 


0.25 uL each of 50 uM forward and reverse primers, 6.125 uL 
of nuclease-free water, and 5 ul. of nucleic acid extract. 
Amplification was carried out in 96-well plates on an iCycler 
10 Real-Time Detection System (Bio-Rad, Hercules, Calif.). 
Thermocycling conditions consisted of 30 minutes at 48? C. 
for reverse transcription, 10 minutes at 95? C. for activation of 
the AmpliTaq Gold DNA polymerase, and 45 cycles of 15 
seconds at 95? C. and 1 minute at 60? C. Each run included 
one SARS-CoV genomic template control and at least two 
no-template controls for the extraction (to check for contami- 
nation during sample processing) and one no-template con- 
trol for the PCR-amplification step. As a control for PCR 
inhibitors, and to monitor nucleic acid extraction efficiency, 
each sample was tested by real-time RT-PCR for the presence 
ofthe human ribonuclease (RNase) P gene (GenBank Acces- 
sion No. NM 006413) by using the following primers and 
probe: forward primer 5'-AGATTTGGACCTGCGAGCG-3' 
(SEQ ID NO: 36); reverse primer 5'-GAGCGGCTGTCTC- 
CACAAGT-3' (SEQ ID NO: 37); probe 5'-TTCTGACC 
TGAAGGCTCTGCGCG-3' (SEQ ID NO: 38). The assay 
reaction was performed identically to that described above 
except that primer concentrations used were 30 uM each. 
Fluorescence measurements were taken and the threshold 
cycle (C7) value for each sample was calculated by determin- 
ing the point at which fluorescence exceeded a threshold limit 
set at the mean plus 10 standard deviations above the baseline. 
A test result was considered positive if two or more of the 
SARS genomic targets showed positive results (C, €45 
cycles) and all positive and negative control reactions gave 
expected values. 


While this disclosure has been described with an emphasis 
upon preferred embodiments, it will be obvious to those of 


Primers and probes used for real-time RT-PCR assays? 


Assay ID Primer/probe Sequence 
Primary diagnostic assay 


SARS1 
SARS2 
GCACGGTGGCAGCATTG (SEQ ID NO: 20) 


SARS3 CAAACATTGGCCGCAAATT (SEQ ID NO: 22) 


CU m Hd pd my my rai 


To confirm positive results 


N3 
CCGAAGAGCTACCCGACG (SEQ ID NO: 26) 


3-NTR 
CACCACATTTTCATCGAGGC (SEQ ID NO: 29) 





UA "d m) d ra bg ri 


CATGTGTGGCGGCTCACTATAT (SEQ ID NO: 16) 
GACACTATTAGCATAAGCAGTTGTAGCA (SEQ ID NO: 17) 
TTAAACCAGGTGGAACATCATCCGGTG (SEQ ID NO: 18) 
GGAGCCTTGAATACACCCAAAG (SEQ ID NO: 19) 


CCACATTGGCACCCGCAATCC (SEQ ID NO: 21) 
CAATGCGTGACATTCCAAAGA (SEQ ID NO: 23) 
CACAATTTGCTCCAAGTGCCTCTGCA (SEQ ID NO: 24) 


GAAGTACCATCTGGGGCTGAG (SEQ ID NO: 25) 


TACCCTCGATCGTACTCCGCGT (SEQ ID NO: 30) 
TGTAGGCACTGATTCAGGTTTTG (SEQ ID NO: 31) 
CGGCGTGGTCTGTATTTAATTTA (SEQ ID NO: 32) 
CTGCATACAACCGCTACCGTATTGGAA (SEQ ID NO: 33) 


Genomic 
Region 


RNA Pol 


Nucleocapsid 


Nucleocapsid 


Nucleocapsid 


CTCTTTCATTTTGCCGTCACCACCAC (SEQ ID NO: 27) 
AGCTCTCCCTAGCATTATTCACTG (SEQ ID NO: 28) 


3-NTR 


M protein 


“RT-PCR, reverse transcription-polymerase chain reaction; F, forward primer; R, reverse primer; P, probe; NTR, nontranslated region. 


Real-Time RT-PCR Assay 


The real-time RT-PCR assay was performed by using the 
Real-Time One-Step RT-PCR Master Mix (Applied Biosys- 
tems, Foster City, Calif.). Each 25-uL reaction mixture con- 
tained 12.5 uL of 2x Master Mix, 0.625 uL of the 40x Mul- 
tiScribe and RNase Inhibitor mix, 0.25 uL of 10 uM probe, 


60 


65 


ordinary skill in the art that variations and equivalents of the 
preferred embodiments may be used and it is intended that the 
disclosure may be practiced otherwise than as specifically 
described herein. Accordingly, this disclosure includes all 
modifications encompassed within the spirit and scope of the 
disclosure as defined by the claims below. 
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SEQUENCE LISTING 


<160> NUMBER OF SEQ ID NOS: 38 


<210> SEQ ID NO 1 
<211> LENGTH: 29727 


<212> TYPE: 


DNA 


<213> ORGANISM: Coronavirus 
<220> FEATURE: 
<221> NAME/KEY: misc_feature 


<222> LOCATION: 


(265). 


. (13398) 


<223> OTHER INFORMATION: ORF la 
<220> FEATURE: 
<221> NAME/KEY: misc_feature 


<222> LOCATION: 


(13398) .. (21482) 


<223> OTHER INFORMATION: ORF 1b 
<220> FEATURE: 
<221> NAME/KEY: misc_feature 


<222> LOCATION: 


(21492) .. (25256) 


<223> OTHER INFORMATION: ORF S 
<220> FEATURE: 
<221> NAME/KEY: misc_feature 





<222> LOCATION: 


(25268) .. (26089) 


<223> OTHER INFORMATION: ORF X1 
<220> FEATURE: 
<221> NAME/KEY: misc feature 


<222> LOCATION: 


(25689) .. (26150 


<223> OTHER INFORMATION: ORF X2 
<220> FEATURE: 
<221> NAME/KEY: misc_feature 


<222> LOCATION: 


(26117) .. (26344) 


<223> OTHER INFORMATION: ORF E 
<220> FEATURE: 
<221> NAME/KEY: misc_feature 


<222> LOCATION: 


(26398) .. (27060) 


<223> OTHER INFORMATION: ORF M 
<220> FEATURE: 
<221> NAME/KEY: misc_feature 


<222> LOCATION: 


(27074) .. (27262) 


«223» OTHER INFORMATION: ORF X3 
«220» FEATURE: 
«221» NAME/KEY: misc feature 


«222» LOCATION: 


(27273) .. (27638) 


«223» OTHER INFORMATION: ORF X4 
«220» FEATURE: 
«221» NAME/KEY: misc feature 


«222» LOCATION: 


(27864) .. (28115) 


«223» OTHER INFORMATION: ORF X5 
«220» FEATURE: 
«221» NAME/KEY: misc feature 


«222» LOCATION: 


(28120)..(29385) 


«223» OTHER INFORMATION: ORF N 


«400» SEQUENCE: 1 


ttattaggtt 


ctctaaacga 


gcagtataaa 


tctgcagact 


gtcegggtgt 


cacgtccaac 


gactctgtgg 


ctagtagagc 


cgttctgatg 


gacggcattc 


gaaaccccaa 


ggtcatagct 


tttacctacc 


actttaaaat 


caataataaa 


gettacggtt 


gaccgaaagg 


tcagtttgcc 


aagaggccct 


tggaaaaagg 


ccttaagcac 


agtacggtcg 


ttgcataccg 


atggcatcga 


caggaaaagc 


ctgtgtagct 


ttttactgtc 


tegtcegtgt 


taagatggag 


tgtccttcag 


atcggaggca 


cgtactgccc 


caatcacggc 


tagcggtata 


caatgttctt 


tctaaagtct 


caaccaacct 


gtegetegge 


gttgacaaga 


tgcagtcgat 


agccttgttc 


gttagagacg 


cgtgaacacc 


cagcttgaac 


cacaaggtcg 


acactgggag 


cttcgtaaga 


tatgacttag 


cgatctcttg 


tgcatgccta 


aacgagtaac 


catcagcata 


ttggtgtcaa 


tgctagtgcg 


tcaaaaatgg 


agccctatgt 


ttgagctggt 


tactcgtgcc 


acggtaataa 


gtgacgagct 


tagatctgtt 


gtgcacctac 


tegtecetet 


cctaggtttc 


cgagaaaaca 


tggetteggg 


cacttgtggt 


gttcattaaa 


tgcagaaatg 


acatgtggge 


gggagecggt 


tggcactgat 


60 


120 


180 


240 


300 


360 


420 


480 


540 


600 


660 


720 
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cccattgaag 
ctcactcgtg 
ccagatgggt 
tgcactcttt 
gaccatgagc 
acaccetteg 
tttgtgtttc 
actgagggtt 
aacaatatgc 
acgtgegact 
ggacctacta 
tgtcaagacc 
attgaaactc 
tatgttgget 
tcaggccata 
atactgagtc 
gttgccatca 
agtcttgatt 
aagggaaagc 
ctgtgtggtt 
gatgcagcaa 
atttctgaac 
aacagtgtca 
ttgtctaatc 
gcgaaactta 
attacaggtg 
gattgtgtaa 
gtcactatcg 
agcaagggac 
cttaaggcac 
tctgaggagg 
ttcacaaatg 
attaaggaca 
tttcgcttaa 
gaagttcaag 
gtgcttaatg 
gcatgtgttg 
aacatgggta 


ggtgaagaaa 


attatgaaca 
agctcaatgg 
accctcttga 
ccgaacaact 
atgaaattgc 
aaattaagag 
ctcttaactc 
tcatggggcg 
acttgtctac 
ttctgaaagc 
catgtgggta 
cagagattgg 
gactccgcaa 
gctataataa 
ctggcattac 
gtgaacgtgt 
ttttggcatc 
acaagtcttt 
cegtaaaagg 
ttccctcaca 
accactcaat 
agtcattacg 
ttattatggc 
ttttgggcac 
gtgcaggagt 
tttttgacat 
aatgcttcat 
ctggegcaaa 
tttaccgtca 
caaaagaagt 
ttgttctcaa 
gagctatcgt 
aagaacaata 
aagggggtge 
gttacaagaa 
aaaagtgctc 
tagcagaggc 
ttgatcttga 


acttttcatc 


45 


aaactggaac 
aggtgcagtc 
ttgcatcaaa 
tgattacatc 
ctggttcact 
tgccaagaaa 
aaaagtcaaa 
tatacgctct 
cttgatgaaa 
cacttgtgaa 
cctacctact 
acctgagcat 
gggaggtagg 
gegtgectac 
tggtgacaat 
taacattaac 
tttctctgct 
caaaaccatt 
tgcttggaac 
ggctgctggt 
tectgatttg 
tcttgtcgac 
atatgtaact 
tactgttgaa 
tgaatttctc 
cgtcaagggt 
tgatgttgtt 
gttgcgatca 
gtgtatacgt 
aacctttctt 
gaacggtgaa 
tggcacacca 
ctgcgcattg 
accaattaaa 
tgtgagaatc 
tgtctacact 
tgttgtgaag 
tgagtggagt 


acgtatgtat 


actaagcatg 
actcgctatg 
gattttctcg 
gagtcgaaga 
gagcgctctg 
tttgacactt 
gtcattcaac 
gtgtaccctg 
tgtaatcatt 
cattgtggca 
aatgctgtag 
agtgttgcag 
actagatgtt 
tgggttcctc 
gtggagacct 
attgttggcg 
tctacaagtg 
gttgagtcct 
attggacaac 
gttatcagat 
caaagagcag 
gccatggttt 
ggtggtcttg 
aaactcaggc 
aaggatgctt 
caaatacagg 
aacaaggcac 
ctcaacttag 
ggcaaggagc 
gaaggtgatt 
ctcgaagcac 
gtctgtgtaa 
tctcctggtt 
ggtgtaacct 
acatttgagc 
gttgaatccg 
actttacaac 
gtagctacat 


tgttcctttt 
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-continued 


gcagtggtgc 
tcgacaacaa 
cacgecgcggg 
gaggtgtcta 
ataagagcta 
tcaaagggga 
cacgtgttga 
ttgcatctcc 
gcgatgaagt 
ctgaaaattt 
tgaaaatgcc 
attatcacaa 
ttggaggctg 
gtgctagtgc 
tgaatgagga 
attttcattt 
cctttattga 
gcggtaacta 
agagatcagt 
caatttttgc 
ctgtcaccat 
atacttcaga 
tacaacagac 
ctatctttga 
gggagattct 
ttgcttcaga 
tcgaaatgtg 
gtgaagtctt 
agctgcaact 
cacatgacac 
tcgagacgee 
atggcctcat 
tactggctac 
ttggagaaga 
ttgatgaacg 
gtaccgaagt 
cagtttctga 
tctacttatt 


accctccaga 


actccgtgaa 
tttctgtggc 
caagtcaatg 
ctgetgeegt 
cgagcaccag 
atgcccaaag 
aaagaaaaag 
acaggagtgt 
ttcatggcag 
agttattgaa 
atgtcctgcc 
ccactcaaac 
tgtgtttgee 
tgatattgge 
tctccttgag 
gaatgaagag 
cactataaag 
taaagttacc 
tttaacacca 
gegcacactt 
acttgatggt 
cctgctcacc 
ttctcagtgg 
atggattgag 
caaatttctc 
taacatcaag 
cattgatcaa 
catcgctcaa 
actcatgcct 
agtacttacc 
egttgatage 
gctcttagag 
aaacaatgtc 
tactgtttgg 
tgttgacaaa 
tactgagttt 
tetcettacc 
tgatgatget 


tgaggaagaa 


780 


840 


900 


960 


1020 


1080 


1140 


1200 


1260 


1320 


1380 


1440 


1500 


1560 


1620 


1680 


1740 


1800 


1860 


1920 


1980 


2040 


2100 


2160 


2220 


2280 


2340 


2400 


2460 


2520 


2580 


2640 


2700 


2760 


2820 


2880 


2940 


3000 


3060 
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gaggacgatg 
acagaggatg 
gttgaggaag 
ccagaaccag 
actgacaatg 
atggtgattg 
ctcaacaagg 
ggccctctta 
ctgcatgttg 
tatgaaaatt 
ggtgctaaac 
attgcagtca 
aagcctagag 
gaggagaaat 
gatgaggtta 
gctgatatca 
tetttecttg 
acttgtgttg 
ttgaagaaag 
tatacacttg 
ccttcagaag 
gaaatgcttg 
gecataatgg 
gactatggtg 
aagctgaact 
tttaatcttg 
gtatcatcac 
tctgaggagc 
tcaggacagc 
cacactctgg 
ctaaagagtc 
aacactaatc 
ccaacatact 
aagactttct 
catactcttg 
tggaaatttc 
ttgtctagtg 
caagaggctt 
gettacagta 


ctacagcatg 


cagagtgtga 
attatcaagg 
aagaagagga 
aacctacacc 
ttgecattaa 
taaatgctgc 
caaccaatgg 
cagtaggagg 
ttggacctaa 
tcaattcaca 
cacttcagtc 
atgacaaagc 
tggaagcacc 
ctgtcgtaca 
ccacaacact 
atggtaaget 
agaaggatge 
taataccetc 
tgccagttga 
aggaagctaa 
cacctaatgc 
ctcatgctga 
caaccatcca 
tccgattctt 
ctctaaatga 
aagaggctgc 
cagatgctgt 
actttgtaga 
gtacagagtt 
agagcccegt 
tcttatccct 
tccacacaca 
tggatggtgc 
ttgtactacc 
atgagagttt 
ctcaagttgg 
ttttattagc 
attatagagc 
ataaaactgt 


ctaatttgga 
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ggaagaagaa 
tctccctctg 
agactggctg 
tgaagaacca 
atgtgttgac 
taacatacac 
tgecatgeaa 
gtettgtttg 
cctaaatgca 
ggacatctta 
tttacaagtg 
tctttatgag 
taaacaagag 
gaagcctgtc 
ggaagaaact 
ttaccatgat 
accttacatg 
caaaaaggct 
tgagtatata 
gactgctctt 
taaggaagag 
agagacaaga 
acgtaagtat 
cttttatact 
gcegcttgtc 
gcgctgtatg 
tactacatat 
aacagtttct 
aggtgttgaa 
cgagtttcat 
gegggaggtt 
gcttgtggat 
tgatgttaca 
tagtgatgac 
tcttggtagg 
tggtttaact 
acttcaacag 
cegtgctggt 
tggcgagctt 


atctgcaaag 


attgatgaaa 
gaatttggtg 
gatgatacta 
gttaatcagt 
atcgttaagg 
ctgaaacatg 
aaggagagtg 
ctttctggac 
ggtgaggaca 
cttgcaccat 
tgegtgeaga 
caggttgtca 
gagccaccaa 
gatgtgaagc 
aagtttctta 
tctcagaaca 
gtaggtgatg 
ggtggcacta 
accacgtacc 
aagaaatgca 
attctaggaa 
aaattaatgc 
aaaggaatta 
agtaaagagc 
acaatgccaa 
cgttctctta 
aatggatacc 
ttggctggct 
tttcttaagc 
cttgacggtg 
aagactataa 
atgtctatga 
aaaattaaac 
acactacgta 
tacatgtctg 
tcaattaaat 
cttgaagtca 
gatgctgcta 
ggtgatgtca 


cgagttetta 
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-continued 
cctgtgaaca tgagtacggt 
cctcagctga aacagttcga 
ctgagcaatc agagattgag 
ttactggtta tttaaaactt 
aggcacaaag tgctaatcct 
gtggtggtgt agcaggtgca 
atgattacat taagctaaat 
ataatcttgc taagaagtgt 
tccagcttct taaggcagca 
tgttgtcagc aggcatattt 
cggttcgtac acaggtttat 
tggattatct tgataacctg 
acacagaaga ttccaaaact 
caaaaattaa ggcctgcatt 
ccaataagtt actcttgttt 
tgcttagagg tgaagatatg 
ttatcactag tggtgatatc 
ctgagatgct ctcaagagct 
ctggacaagg atgtgctggt 
aatctgcatt ttatgtacta 
ctgtatcctg gaatttgaga 
ctatatgcat ggatgttaga 
aaattcaaga gggcatcgtt 
ctgtagcttc tattattacg 
ttggttatgt gacacatggt 
aagctcctgc cgtagtgtca 
tcacttcgtc atcaaagaca 
cttacagaga ttggtcctat 
gtggtgacaa aattgtgtac 
aggttctttc acttgacaaa 
aagtgttcac aactgtggac 
catatggaca gcagtttggt 
ctcatgtaaa tcatgagggt 
gtgaagcttt cgagtactac 
ctttaaacca cacaaagaaa 
gggctgataa caattgttat 
aattcaatgc accagcactt 
acttttgtgc actcatactc 
gagaaactat gacccatctt 


atgtggtgtg taaacattgt 


3120 


3180 


3240 


3300 


3360 


3420 


3480 


3540 


3600 


3660 


3720 


3780 


3840 


3900 


3960 


4020 


4080 


4140 


4200 


4260 


4320 


4380 


4440 


4500 


4560 


4620 


4680 


4740 


4800 


4860 


4920 


4980 


5040 


5100 


5160 


5220 


5280 


5340 


5400 


5460 


48 


ggtcagaaaa 
tatgataatc 
tatctagtac 
ttacagcaag 
tacactcata 
atgtcagagt 
accatcaage 
ttggatgggt 
ccaactcaac 
aaatttgetg 
tetgtcacat 
tcagcgagtt 
caggctacaa 
acaaagccag 
atggacaatc 
accatacaga 
atacttaaac 
atggctgctt 
gecttaggtt 
agtaaaattt 
tgegetaaga 
ttgttccaat 
acaactattg 
aattatgtga 
ttaagtattt 
aattttggtg 
gttactacta 
gactcccttg 
ctagacttga 
aaattctttt 
agtcatttca 
cccgtttctg 
agctatgttc 
aatcgtgcca 
gtctatgcaa 
gacacatttt 
cagtttaaaa 
gtgaaaaatg 


catccgctct 


ctactacctt 
ttaagacagg 
aacaagagtc 
gtacattctt 
taactgctaa 
acaaaggacc 
ctgtgtcgta 
attataaaaa 
cattaccaaa 
atgatttaaa 
tcttcccaga 
tcaagaaagg 
ccaagacaac 
tagatactte 
ttgettgtga 
aggaagtcat 
catcagatga 
atgtggaaaa 
taaaaacaat 
tggettatgt 
gattagcaca 
tgtgtacttt 
ctaaaaatag 
agtcacccaa 
gettaggtte 
ctccttctta 
tggatttctg 
attcttatcc 
caattttagg 
atttattagg 
tcagcaattc 
caatggttag 
atatcatgga 
cacgcgttga 
atggaggecg 
gcactggtag 
gaccaatcaa 
gegegettca 


cccattttgt 
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aacgggtgta 
tgtttccatt 
ttcttttgtt 
atgtgcgaat 
ggagaccctc 
agtgactgat 
taaactcgat 
ggataatget 
tgegagtttt 
tcaaatgaca 
cttgaatggc 
tgctaaatta 
gttcaaacca 
aaattcattt 
aagtcaacaa 
agagtgtgac 
aggtgttaaa 
cacaagcatt 
tgccactcat 
caaaccattc 
acgtgtgttt 
tactaaaagt 
tgttaagagt 
attttctaaa 
tctaatctgt 
ttgtaatggc 
tgaaggttct 
agctcttgaa 
tetggecget 
tetttcaget 
ttggetcatg 
gatgtacatc 
tggttgcacc 
gtgtacaact 
tggcttctgc 
tacattcatt 
ccctactgac 
ectctacttt 


caatttagac 


gaagctgtga 
ccatgtgtgt 
atgatgtctg 
gagtacactg 
tatcgtattg 
gttttctaca 
ggagttactt 
tactatacag 
gataatttca 
ggcttcacaa 
gatgtagtgg 
ctgcataagc 
aacacttggt 
gaagttctgg 
cccacctctg 
gtgaaaacta 
gtaacacaag 
accattaaga 
ggtattgctg 
ttaggacaag 
aacaattata 
accaattcta 
gttgctaaat 
ttgttcacaa 
gtaactgctg 
gttagagaat 
tttccttgca 
accattcagg 
gagtgggttt 
ataatgcagg 
tggtttatca 
ttctttgctt 
tcttcgactt 
attgttaatg 
aagactcaca 
agtgatgaag 
cagtcatcgt 


gacaaggetg 


aatttgagag 
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tgtatatggg tactctatct 
gtggtcgtga tgctacacaa 
caccacctge tgagtataaa 
gtaactatca gtgtggtcat 
acggagctca ccttacaaag 
aggaaacatc ttacactaca 
acacagagat tgaaccaaaa 
agcagcctat agaccttgta 
aactcacatg ttctaacaca 
agccagcttc acgagagcta 
ctattgacta tagacactat 
caattgtttg gcacattaac 
gtttacgttg tetttggagt 
cagtagaaga cacacaagga 
aagaagtagt ggaaaatcct 
ccgaagttgt aggcaatgtc 
agttaggtca tgaggatctt 
aacctaatga gctttcacta 
caattaatag tgttccttgg 
cagcaattac aacatcaaat 
tgccttatgt gtttacatta 
gaattagage ttcactacct 
tatgtttgga tgccggcatt 
tegctatgtg gctattgttg 
cttttggtgt actcttatct 
tgtatcttaa ttcgtctaac 
gcatttgttt aagtggatta 
tgacgatttc atcgtacaag 
tggcatatat gttgttcaca 
tgttctttgg ctattttgct 
ttagtattgt acaaatggca 
ctttctacta catatggaag 
gcatgatgtg ctataagcgc 
gcatgaagag atctttctat 
attggaattg tctcaattgt 
ttgctcgtga tttgtcactc 
atattgttga tagtgttgct 
gtcaaaagac ctatgagaga 


ctaacaacac taaaggttca 


5520 


5580 


5640 


5700 


5760 


5820 


5880 


5940 


6000 


6060 


6120 


6180 


6240 


6300 


6360 


6420 


6480 


6540 


6600 


6660 


6720 


6780 


6840 


6900 


6960 


7020 


7080 


7140 


7200 


7260 


7320 


7380 


7440 


7500 


7560 


7620 


7680 


7740 


7800 


ctgcctatta 
tetgettetg 
ettgtatcag 
gacacctttt 
gctcacagcg 
getgecegac 
aaactttcac 
acctataata 
gcaaggcata 
aaagactaca 
aacaacatac 
actaaaatct 
gecacattat 
ttgtcaatcc 
gtcactegtg 
gcatggttta 
getatcatta 
gcaatcaatg 
tgetacacac 
getgetgagt 
actaatttge 
cttatggatg 
gtaacaactt 
atttgectat 
ggagttttct 
caacctgtgg 
ttggtgactt 
catgttgttg 
ccagcttaca 
ttcaccaatg 
gtgccttttt 
ttctttaaca 
gaggaggctg 
gagacactgt 
tatttcagtg 
aaggctctaa 
tcaatcactt 
gttgaagggt 
gatgacacag 


aactatgaag 


atgtcatagt 
tgtactacag 
acgttggaga 
cagcaacttt 
agttagcaaa 
aaggtgttgt 
atcactctga 
aggttgaaaa 
tcaatgccca 
tgtctttatc 
cttttagact 
cactcaaggg 
tgtgegttet 
atgatggtta 
acatcatttc 
gecagegtgg 
caagagagat 
gtgacttctt 
cttccaaact 
gtacaatttt 
tagagggttc 
gttccatcat 
ttgatgctga 
ctaccagtgg 
gtggtgttga 
gtgctttaga 
gtgctgccta 
ctgctaatgc 
gctttctgcc 
atgtttcatt 
ggataacagc 
actatcttag 
ctttgtgtac 
tgccacttac 
gagccttaga 
atgactttag 
ctgctgttct 
gcatggtaca 
tatactgtcc 


atctgctcat 


51 


ttttgatgge 
tcagetgatg 
tagtactgaa 
tagtgttect 
gggtgtaget 
tgataccgat 
cttagaagtg 
catgacgece 
agtagcaaaa 
tgaacagctg 
aacttgtgct 
tggtaagatt 
tgctgcattg 
cacaaatgaa 
tactgatgat 
tggttcatac 
tggtttcata 
gcattttcta 
cattgagtat 
taaggatgct 
tatttcttat 
acagtttcct 
gtactgtaga 
tagatgggtt 
tgcgatgaat 
tgtgtctgct 
ctactttatg 
acttttgttt 
gggagtctac 
cttggctcac 
aatctatgta 
gaaaagagtc 
ctttttgctc 
acagtataac 
tactaccagc 
caactcaggt 
geagagtggt 
agtaacctgt 
aagacatgte 


tcgcaaatcc 


aagtccaaat 
tgccaaccta 
gtttccgtta 
atggaaaaac 
ttagatggtg 
gttgacacaa 
acaggtgaca 
agagatcttg 
agtcacaatg 
cgtaaacaaa 
acaactagac 
gttagtactt 
gtttgttata 
atcattggtt 
tgttttgcaa 
aaaaatgaca 
gtgcctggct 
cctcgtgttt 
agtgattttg 
atgggcaaac 
agtgagcttc 
aacacttacc 
catggtacat 
cttaataatg 
ctcatagcta 
tcagtagtgg 
aaattcagac 
ttgatgtctt 
tcagtctttt 
cttcaatggt 
ttctgtattt 
atgtttaatg 
aacaaggaaa 
aggtatcttg 
tatcgtgaag 
gctgatgttc 
tttaggaaaa 
ggaactacaa 
atttgcacag 


aaccatagct 
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-continued 
gcgacgagtc tgcttctaag 
ttctgttgct tgaccaagtt 
agatgtttga tgcttatgtc 
ttaaggcact tgttgctaca 
tcctttctac attcgtgtca 
aggatgttat tgaatgtctc 
gttgtaacaa tttcatgctc 
gcgcatgtat tgactgtaat 
tttcactcat ctggaatgta 
ttcgtagtgc tgccaagaag 
aggttgtcaa tgtcataact 
gttttaaact tatgcttaag 
tcgttatgcc agtacataca 
acaaagccat tcaggatggt 
ataaacatgc tggttttgac 
aaagctgccc tgtagtagct 
taccgggtac tgtgctgaga 
ttagtgctgt tggcaacatt 
ctacctctgc ttgegttett 
ctgtgccata ttgttatgac 
gtccagacac tcgttatgtg 
tggagggttc tgttagagta 
gcgaaaggtc agaagtaggt 
agcattacag agctctatca 
acatctttac tcctcttgtg 
ctggtggtat tattgccata 
gtgtttttgg tgagtacaac 
tcactatact ctgtctggta 
acttgtactt gacattctat 
ttgccatgtt ttctcctatt 
ctctgaagca ctgccattgg 
gagttacatt tagtaccttc 
tgtacctaaa attgcgtagc 
ctctatataa caagtacaag 
cagcttgctg ccacttagca 
tctaccaacc accacagaca 
tggcattccc gtcaggcaaa 
ctcttaatgg attgtggttg 
cagaagacat gcttaatcct 


ttcttgttca ggctggcaat 


7860 


7920 


7980 


8040 


8100 


8160 


8220 


8280 


8340 


8400 


8460 


8520 


8580 


8640 


8700 


8760 


8820 


8880 


8940 


9000 


9060 


9120 


9180 


9240 


9300 


9360 


9420 


9480 


9540 


9600 


9660 


9720 


9780 


9840 


9900 


9960 


10020 


10080 


10140 


10200 


52 


gttcaacttc 
acttctaacc 
tcagttctag 
aatcatacca 
gattatgatt 
gctggtactg 
gctgcaggta 
atcaatggtg 
gtggcaatga 
ctttctgctc 
cagaatggta 
ccatttgatg 
gttaagggca 
caaagtacac 
cttggtatta 
ttgtgettgt 
cctgctagct 
ggttataggc 
acagctcgca 
acacttgttt 
ttagttattt 
agagctatag 
ttacagtgta 
cttttctgtt 
tctacacaag 
gatgctttca 
gctactgtac 
cttcaacaac 
aatgatattc 
tctgttttgc 
gataaccgtg 
gcttatgcca 
gttctcaaaa 
gccatgcaac 
gcaagatctg 
atgcttagga 
tgtgttccac 
gattatggta 


tgggaaatcc 


gtgttattgg 
ctaagacacc 
catgctacaa 
ttaaaggttc 
gegtgtettt 
acttagaagg 
cagacacaac 
ataggtggtt 
agtacaacta 
aaacaggaat 
tgaatggtcg 
ttgttagaca 
ctcatcattg 
agtggtcact 
tggcaattgc 
ttctgttacc 
gggtgatgeg 
ttaaggattg 
ctgtttatga 
acaaagtcta 
ctgtaacctc 
tgtttgtgtg 
tcatgcttgt 
tactcaaccg 
aatttaggta 
agcttaacat 
agtctaaaat 
ttagagtaga 
ttcttgcaaa 
tatccatgca 
ctactcttca 
ctgcccagga 
agttaaagaa 
gcaagttgga 
aggacaagag 
agcttgataa 
tcaacatcat 
cctacaagaa 


agcaagttgt 


53 


ccattctatg 
caagtataaa 
tggttcacca 
tttccttaat 
ctgctatatg 
taaattctat 
cataacatta 
tcttaataga 
tgaacctttg 
tgeegtetta 
tactatcctt 
atgctctggt 
gatgctttta 
gtttttcttt 
tgcatgtgct 
ttctcttgca 
tatcatgaca 
tgttatgtat 
tgatgctgct 
ctatggtaat 
taactattct 
tgttgagtat 
ttattgtttc 
ttacttcagg 
tatgaactcc 
taagttgttg 
gtctgacgta 
gtcatcttct 
agacacaact 
gggtgctgta 
ggctattgct 
ggectatgag 
atctttgaat 
aaagatggca 
ggcaaaagta 
tgatgcactt 
accattgact 
cacttgtgat 


tgatgcggat 


caaaattgtc 
tttgteegta 
tetggtgttt 
ggatcatgtg 
catcatatgg 
ggtccatttg 
aatgttttgg 
ttcaccacta 
acacaagatc 
gatatgtgtg 
ggtagcacta 
gttaccttcc 
actttcttga 
gtttacgaga 
atgctgcttg 
acagttgctt 
tggcttgaat 
gcttcagctt 
agacgtgttt 
gctttagatc 
ggtgtegtta 
tacccattgt 
ttaggetatt 
cttactettg 
caggggettt 
ggtattggag 
aagtgcacat 
aaattgtggg 
gaagetttcg 
gacattaata 
tcagaattta 
caggctgtag 
gtggctaaat 
gatcaggcta 
actagtgcta 
aacaacatta 
acagcagcca 
ggtaacacct 


agcaagattg 
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tgcttaggct taaagttgat 
tccaacctgg tcaaacattt 
atcagtgtgc catgagacct 
gtagtgttgg ttttaacatt 
agcttccaac aggagtacac 
ttgacagaca aactgcacag 
catggctgta tgctgctgtt 
ctttgaatga ctttaacctt 
atgttgacat attgggacct 
ctgctttgaa agagctgctg 
ttttagaaga tgagtttaca 
aaggtaagtt caagaaaatt 
catcactatt gattcttgtt 
atgctttctt gccatttact 
ttaagcataa gcacgcattc 
actttaatat ggtctacatg 
tggctgacac tagcttgtct 
tagttttgct tattctcatg 
ggacactgat gaatgtcatt 
aagctatttc catgtgggcc 
cgactatcat gtttttagct 
tatttattac tggcaacacc 
gttgctgctg ctactttggc 
gtgtttatga ctacttggtc 
tgcctcctaa gagtagtatt 
gtaaaccatg tatcaaggtt 
ctgtggtact gctctcggtt 
cacaatgtgt acaactccac 
agaagatggt ttctcttttg 
ggttgtgcga ggaaatgctc 
gttctttacc atcatatgcc 
ctaatggtga ttctgaagtc 
ctgagtttga ccgtgatgct 
tgacccaaat gtacaaacag 
tgcaaacaat gctcttcact 
tcaacaatgc gcgtgatggt 
aactcatggt tgttgtccct 
ttacatatgc atctgcactc 


ttcaacttag tgaaattaac 


10260 


10320 


10380 


10440 


10500 


10560 


10620 


10680 


10740 


10800 


10860 


10920 


10980 


11040 


11100 


11160 


11220 


11280 


11340 


11400 


11460 


11520 


11580 


11640 


11700 


11760 


11820 


11880 


11940 


12000 


12060 


12120 


12180 


12240 


12300 


12360 


12420 


12480 


12540 


sd 


atggacaatt 
gctgttaaac 
gctggtacca 
aagggaggta 
ttecctaaga 
gttacagaca 
aacctaaata 
aatgctacag 
cctgctaaag 
aagatgttgt 
atggaccaag 
catccaaatc 
tgtgctaatg 
tggaaaggtt 
gcatcaacgt 
caggcactag 
gttttgcaaa 
atttattaga 
agactattta 
ttagagtaga 
tggctgattt 
aaatactcgt 
acttcgtaga 
aatcattatt 
tactgacatt 
aagtagcacc 
tectcacttt 
cacttattaa 
acegttattt 
ataggtgtat 
caagttttgg 
ctggatacca 
egegtctcag 
etggcaattt 
atgttgettt 
tgtctaaagg 
aggatggcaa 
gtgatatcag 
atggtggctg 


tcccatttaa 


caccaaattt 
tacagaataa 
cacaaacagc 
ggtttgtgct 
gtgatggtac 
caccaaaagg 
gaggtatggt 
aagtacctgc 
catataagga 
gtacacacac 
agtcctttgg 
ctaaaggatt 
acccagtggg 
atggctgtag 
ttttaaacgg 
tactgatgtc 
gttcctaaaa 
etcttacttt 
taacttggtt 
tggtgacatg 
agtctatgct 
cacatacaat 
gaatcctgac 
aaagactgta 
agataatcag 
aggctgegga 
gactagggca 


gtgggatttg 


taaatattgg 
ccttcattgt 
accactagta 
ttttcgtgag 
tttcaaggaa 
attgctagat 
tcaaactgtc 
tttctttaag 
cgctgctatc 
acaactccta 
tattaatgcc 


taaatggggt 


DI 


ggettggect 
tgaactgagt 
ttgtactgat 
ggcattacta 
aggtacaatt 
gcctaaagtg 
gctgggcagt 
caattcaact 
ttacctagca 
tggtacagga 
tggtgcttca 
ctgtgacttg 
ttttacactt 
ttgtgaccaa 
gtttgeggtg 
gtctacaggg 
actaattget 
gtagttaaga 
aaagattgte 
gtaccacata 
ctacgtcatt 
tgetgtgatg 
atcttacgcg 
caattetgeg 
gatcttaatg 
gttcctattg 
ttggctgctg 
ctgaaatatg 
gaccagacat 
gcaaacttta 
agaaaaatat 
ttaggagtcg 
cttttagtgt 
aaacgcacta 
aaacccggta 
gaaggaagtt 
agtgattatg 
ttcgtagttg 
aaccaagtaa 


aaggctagac 


cttattgtta 
ccagtagcac 
gacaatgcac 
tcagaccacc 
tacacagaac 
aaatacttgt 
ttagctgcta 
gtgctttcct 
agtggaggac 
caggcaatta 
tgttgtctgt 
aaaggtaagt 
agaaacacag 
ctccgcgaac 
taagtgcagc 
cttttgatat 
gtcgcttcca 
ggcatactat 
cagcggttgc 
tatcacgtca 
ttgatgaggg 
atgattattt 
tatatgctaa 
atgctatgcg 
ggaactggta 
tggattcata 
agtcccatat 
attttacgga 
accatcccaa 
atgtgttatt 
ttgtagatgg 
tacataatca 
atgctgctga 
catgcttttc 
attttaataa 
ctgttgaact 
actattatcg 
aagttgttga 
tegttaacaa 


tttattatga 
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-continued 
cagctctaag agccaactca 
tacgacagat gtcctgtgcg 
ttgcctacta taacaattcg 
aagatctcaa atgggctaga 
tggaaccace ttgtaggttt 
acttcatcaa aggcttaaac 
cagtacgtct tcaggctgga 
tctgtgcttt tgcagtagac 
aaccaatcac caactgtgtg 
ctgtaacacc agaagctaac 
attgtagatg ccacattgac 
acgtccaaat acctaccact 
tctgtaccgt ctgcggaatg 
ccttgatgca gtctgcggat 
cegtettaca ccgtgcggca 
ttacaacgaa aaagttgctg 
ggagaaggat gaggaaggca 
gtctaactac caacatgaag 
tgtccatgac tttttcaagt 
gcgtctaact aaatacacaa 
taattgtgat acattaaaag 
caataagaag gattggtatg 
cttaggtgag cgtgtacgcc 
tgatgcaggc attgtaggcg 
cgatttcggt gatttcgtac 
ttactcattg ctgatgccca 
ggatgctgat ctcgcaaaac 
agagagactt tgtctcttcg 
ttgtattaac tgtttggatg 
ttctactgtg tttccaccta 
tgttcctttt gttgtttcaa 
ggatgtaaac ttacatagct 
tccagctatg catgcagctt 
agtagctgca ctaacaaaca 
agacttttat gactttgctg 
aaaacacttc ttctttgctc 
ttataatctg ccaacaatgt 
taaatacttt gattgttacg 
tctggataaa tcagctggtt 


ctcaatgagt tatgaggatc 


12600 


12660 


12720 


12780 


12840 


12900 


12960 


13020 


13080 


13140 


13200 


13260 


13320 


13380 


13440 


13500 


13560 


13620 


13680 


13740 


13800 


13860 


13920 


13980 


14040 


14100 


14160 


14220 


14280 


14340 


14400 


14460 


14520 


14580 


14640 


14700 


14760 


14820 


14880 


14940 


S6 


aagatgcact 
ttaagtatge 
gtactatgac 
gagctactgt 
ctgtttacag 
gagccatgcc 
cttgctgtaa 
gtgagatggt 
atgctacaac 
taaatgcact 
aacacaggct 
agttttacgc 
tgtgctataa 
cagttcttta 
accttactaa 
atgattacgt 
tcgatgatat 
ttgatgctta 
atttacaata 
ccgtaatgct 
tgtacacacc 
ettcacttcg 
accatgtcat 
ccccaggttg 
gcaagtcaca 
tatacaaaaa 
gtgattggac 
ttttcgcagc 
ctactgtacg 
ctagaccace 
aagtacagat 
gaggtactac 
taatgccact 
tgtacccaac 
tcggcatgca 
ccatcggact 
cagctgttga 
gaatcatacc 


tagaacagta 


tttegegtat 
cattagtgca 
aaatagacag 
ggtaattgga 
tgatgtagaa 
taacatgett 
cttatcacac 
catgtgtgge 
tgettatget 
tetttcaact 
ctatgagtgt 
ttacctgegt 
cagtaactat 
ttatcaaaat 
aggacctcac 
gtacctgcct 
tgtcaaaaca 
cccacttaca 
cattagaaag 
aactaatgat 
acatacagtc 
ttgcggtgcc 
ttcaacatca 
tgatgtcact 
taagcctccc 
cacatgtgta 
taatgctggc 
agaaacgctc 
cgaagtactc 
attgaacaga 
tggagagtac 
gacatacaag 
tagtgcacct 
actcaacatc 
aaagtactct 
tgetetetat 
tgccctatgt 
tgegegtgeg 


tgttttetge 


57 


actaagcgta 
aagaatagag 
tttcatcaga 
acaagcaagt 
actccacacc 
aggataatgg 
cgtttctaca 
ggctcactat 
aatagtgtct 
gatggtaata 
ctctatagaa 
aaacatttct 
gcggctcaag 
aatgtgttca 
gaattttgct 
tacccagatc 
gatggtacac 
aaacatccta 
ttacatgatg 
aacacctcac 
ttgcaggctg 
tgtattagga 
cacaaattag 
gatgtgacac 
attagttttc 
ggcagtgaca 
gattacatac 
aaagccactg 
tctgacagag 
aactatgtct 
acctttgaaa 
ttgaatgttg 
actctagtgc 
tcagatgagt 
acactccaag 
tacccatctg 
gaaaaggcat 
egegtagagt 


actgtaaatg 


atgtcatccc 
ctegcaccegt 
aattattgaa 
tttacggtgg 
ttatgggttg 
cctctcttgt 
ggttagctaa 
atgttaaacc 
ttaacatttg 
agatagctga 
atagggatgt 
ccatgatgat 
gtttagtage 
tgtctgagge 
cacagcatac 
catcaagaat 
ttatgattga 
atcaggagta 
agettactgg 
ggtactggga 
taggtgettg 
gaccattcct 
tgttgtctgt 
aactgtatct 
cattatgtgc 
atgtcactga 
ttgccaacac 
aggaaacatt 
aattgcatct 
ttactggtta 
aaggtgacta 
gtgattactt 
cacaagagca 
tttctagcaa 
gaccacctgg 
ctcgcatagt 
taaaatattt 
gttttgataa 


cattgccaga 
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tactataact 
agctggtgtc 
gtcaatagcc 
ctggcataat 
ggattatcca 
tettgetege 
cgagtgtgcg 
aggtggaaca 
tcaagctgtt 
caagtatgtc 
tgatcatgaa 
tctttctgat 
tagcattaag 
aaaatgttgg 
aatgctagtt 
attaggcgca 
aaggttcgtg 
tgctgatgtc 
ccacatgttg 
acctgagttt 
tgtattgtgc 
atgttgcaag 
taatccctat 
aggaggtatg 
taatggtcag 
cttcaatgcg 
ttgtactgag 
taagctgtca 
ttcatgggag 
cegtgtaact 
tggtgatget 
tgtgttgaca 
ctatgtgaga 
tgttgcaaat 
tactggtaag 
gtatacggca 
gcccatagat 
attcaaagtg 


aacaactgct 


caaatgaatc 
tctatctgta 
gccactagag 
atgttaaaaa 
aaatgtgaca 
aaacataaca 
caagtattaa 
tcatccggtg 
acagccaatg 
egcaatctac 
ttcgtggatg 
gatgccgttg 
aactttaagg 
actgagactg 
aaacaaggag 
ggctgttttg 
tcactggcta 
tttcacttgt 
gacatgtatt 
tatgaggcta 
aattcacaga 
tgctgctatg 
gtttgcaatg 
agctattatt 
gtttttggtt 
atagcaacat 
agactcaagc 
tatggtattg 
gttggaaaac 
aaaaatagta 
gttgtgtaca 
tctcacactg 
attactggct 
tatcaaaagg 
agtcattttg 
tgctctcatg 
aaatgtagta 
aattcaacac 


gacattgtag 


15000 


15060 


15120 


15180 


15240 


15300 


15360 


15420 


15480 


15540 


15600 


15660 


15720 


15780 


15840 


15900 


15960 


16020 


16080 


16140 


16200 


16260 


16320 


16380 


16440 


16500 


16560 


16620 


16680 


16740 


16800 


16860 


16920 


16980 


17040 


17100 


17160 


17220 


17280 


tetttgatga 
gtgcaaaaca 
tgactaaagg 
taggtccaga 
tgagtgettt 
tcaaaatgtt 
aaataggcgt 
tctcacctta 
ctgttgattc 
cagcacactc 
ttttgtgcat 
taccacgtcg 
gtagtaagat 
taaagttcaa 
accgtagact 
atatgtttat 
tagagggctg 
tttctacagg 
cagaattcac 
cactcatgta 
gtgatacact 
agcttacatc 
acaaacgtgc 
tgggttttga 
gtaaccttca 
gttgtgatgc 
attggtctgt 
aagtacaaca 
acattggaaa 
acgatgctca 
ctacacatca 
gttacccagc 
taccaggctg 
tcgataaaag 
cttgtgagtc 
ctacgtgtat 
accgacagta 
acaaacaatt 
atgtggctta 


tttccatcat 


aatctctatg 
ctacgtctat 
cacactagaa 
catgtteett 
agtttatgac 
ctacaaaggt 
tgtaagagaa 
taattcacag 
atcacagggt 
ttgtaatgte 
aatgtctgat 
caatgtgget 
cattactggt 
gactgaagga 
catctctatg 
cacccgcgaa 
tcatgcaact 
tgttaactta 
cagagttaat 
taaaggcttg 
gaaaggattg 
aatgaagtac 
aacttgcttt 
ctatgtctat 
gagtaaccat 
tatcatgact 
tgaataccct 
catggttgtg 
tccaaaggct 
gecatgtagt 
cgataaattc 
caatgcaatt 
tgatggtggt 
tgcatttact 
tcatggcaaa 
tacacgatgc 
cttggatgca 
tgatacttat 
taatgttgtt 


taataatgct 


59 


getactaatt 
attggcgatc 
ccagaatatt 
ggaacttgte 
aataagctaa 
gttattacac 
tttcttacac 
aacgetgtag 
tetgaatatg 
aaccgettca 
agagatettt 
acattacaag 
cttcatccta 
ttatgtgttg 
atgggtttca 
gaagctattc 
agagatgctg 
gtagctgtac 
gcaaaacctc 
ccctggaatg 
tcagacagag 
tttgtcaaga 
tctacttcat 
aacccattta 
gaccaacatt 
agatgtttag 
attataggag 
aagtctgcat 
atcaagtgtg 
gacaaagctt 
actgatggtg 
gtgtgtaggt 
agtttgtatg 
aatttaaagc 
caagtagtgt 
aatttaggtg 
tataatatga 
aacctgtgga 
aataaaggac 


gtttacacaa 


atgacttgag 
ctgctcaatt 
ttaattcagt 
gcegttgtcc 
aagcacacaa 
atgatgtttc 
gcaatcctgc 
cttcaaaaat 
actatgtcat 
atgtggctat 
atgacaaact 
cagaaaatgt 
cacaggcacc 
acataccagg 
aaatgaatta 
gtcacgttcg 
tgggtactaa 
egactggtta 
caccaggtga 
tagtgegtat 
tegtgttegt 
ttggacctga 
cagatactta 
tgattgatgt 
gecaggtaca 
cagtccatga 
atgaactgag 
tgettgetga 
tgectcagge 
acaaaataga 
tttgtttgtt 
ttgacacaag 
tgaataagca 
aattgccttt 
cggatattga 
gtgctgtttg 
tgatttctgc 
atacatttac 
actttgatgg 


aggtagatgg 
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tgttgtcaat gctagacttc 
accagccccc cgcacattgc 
gtgcagactt atgaaaacaa 
tgctgaaatt gttgacactg 
ggataagtca gctcaatgct 
atctgcaatc aacagacctc 
ttggagaaaa gctgttttta 
cttaggattg cctacgcaga 
attcacacaa actactgaaa 
cacaagggca aaaattggca 
gcaatttaca agtctagaaa 
aactggactt tttaaggact 
tacacacctc agcgttgata 
cataccaaag gacatgacct 
ccaagtcaat ggttacccta 
tgcgtggatt ggctttgatg 
ectacctcte cagctaggat 
tgttgacact gaaaataaca 
ccagtttaaa catcttatac 
taagatagta caaatgctca 
cctttgggcg catggctttg 
aagaacgtgt tgtctgtgtg 
tgcctgctgg aatcattctg 
tcagcagtgg ggctttacgg 
tggaaatgca catgtggcta 
gtgctttgtt aagcgcgttg 
ggttaattct gcttgcagaa 
taagtttcca gttcttcatg 
tgaagtagaa tggaagttct 
ggagctcttc tattcttatg 
ttggaattgt aacgttgatc 
agccttgtca aacttgaact 
tgcattccac actccagctt 
cttttactat tctgatagtc 
ttatgttcca ctcaaatctg 
cagacaccat gcaaatgagt 
tggatttagc ctatggattt 
caggttacag agtttagaaa 
acacgccggc gaagcacctg 


tattgatgtg gagatctttg 


17340 


17400 


17460 


17520 


17580 


17640 


17700 


17760 


17820 


17880 


17940 


18000 


18060 


18120 


18180 


18240 


18300 


18360 


18420 


18480 


18540 


18600 


18660 


18720 


18780 


18840 


18900 


18960 


19020 


19080 


19140 


19200 


19260 


19320 


19380 


19440 


19500 


19560 


19620 


19680 
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aaaataagac 
aaccagtgec 
taatctggga 
tgactgacat 
atggtagagt 
cagaaggttc 
gagtcacatt 
gcattattca 
agcccagatc 
gatataagct 
aacttggcgg 
aattagagga 
aaacaggttc 
agataataaa 
atgctgaaat 
aactacaagc 
aaagaatgct 
aaggaataat 
ctttagctgt 
ttgcaccagg 
cagatcttaa 
tacatacggc 
atgtgacaaa 
agcaaaaact 
ctgaccttta 
atgcatcatc 
aaattgatgg 
agttgtette 
ctgtaatgtc 
gtaggcttat 
actaaacgaa 
accggtgcac 
tgaggggggt 
atttatttct 
gcaaccctgt 
ttgtcegtgg 
ttaacaattc 
tetttgetgt 


ttaattgcac 


aacacttcct 
agagattaag 
ctacaaaaga 
tgecaagaaa 
ggaaggacag 
agtcaaaggt 
aattggagaa 
acagttgect 
acaaatggaa 
cgagggetat 
tcttcattta 
ttttatccct 
atcaaaatgt 
gtcacaagat 
ttcattcatg 
aagtcaagcg 
tcttgaaaag 
gatgaatgtc 
accctacaac 
tacagctgtg 
tgacttcgtc 
taataaatgg 
agagaatgac 
agccetgggt 
caagcttatg 
atcggaagca 
ctataccatg 
ctattcactc 
tcttaaggag 
cattagagaa 
catgtttatt 
cacttttgat 
ttactatcct 
tccattttat 
catacctttt 
ttgggttttt 
tactaatgtt 
ttetaaaccc 


tttcgagtac 


61 


gttaatgttg 
atactcaata 
gaagecccag 
cctactgaga 
gtagaccttt 
ctaacacctt 
tcagtaaaaa 
gaaacctact 
actgactttc 
gecttegaac 
atgatagget 
atggacagca 
gtgtgttetg 
ttgtcagtga 
etttggtgta 
tggcaaccag 
tgtgaccttc 
gcaaagtata 
atgagagtta 
ctcagacaat 
tccgacgcag 
gaccttatta 
tctaaagaag 
ggttctatag 
ggccatttct 
tttttaattg 
catgctaact 
tttgacatga 
aatcaaatca 
aacaacagag 
ttcttattat 
gatgttcaag 
gatgaaattt 
tctaatgtta 
aaggatggta 
ggttctacca 
gttatacgag 
atgggtacac 


atatctgatg 


catttgagct 
atttgggtgt 
cacatgtatc 
gtgcttgttc 
ttagaaacgc 
caaagggacc 
cacagtttaa 
ttactcagag 
tcgagctcegc 
acatcgttta 
tagccaagcg 
cagtgaaaaa 
tgattgatct 
tttcaaaagt 
aggatggaca 
gtgttgcgat 
agaattatgg 
ctcaactgtg 
ttcactttgg 
ggttgccaac 
attctacttt 
ttagcgatat 
ggtttttcac 
ctgtaaagat 
catggtggac 
gggctaacta 
acattttctg 
gcaaatttcc 
atgatatgat 
ttgtggttte 
ttcttactct 
ctcctaatta 
ttagatcaga 
cagggtttca 
tttattttgc 
tgaacaacaa 
catgtaactt 
agacacatac 


cetttteget 
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ttgggetaag cgtaacatta 
tgatatcget gctaatactg 
tacaataggt gtetgcacaa 
ttcacttact gtettgtttg 
cegtaatggt gttttaataa 
agcacaagct agcgtcaatg 
ctactttaag aaagtagacg 
cagagactta gaggatttta 
tatggatgaa ttcatacagc 
tggagatttc agtcatggac 
ctcacaagat tcaccactta 
ttacttcata acagatgcgc 
tttacttgat gactttgtcg 
ggtcaaggtt acaattgact 
tgttgaaacc ttctacccaa 
gcctaacttg tacaagatgc 
tgaaaatgct gttataccaa 
tcaatactta aatacactta 
tgctggctct gataaaggag 
tggcacacta cttgtcgatt 
aattggagac tgtgcaacag 
gtatgaccct aggaccaaac 
ttatctgtgt ggatttataa 
aacagagcat tcttggaatg 
agcttttgtt acaaatgtaa 
tcttggcaag ccgaaggaac 
gaggaacaca aatcctatcc 
tcttaaatta agaggaactg 
ttattctctt ctggaaaaag 
aagtgatatt cttgttaaca 
cactagtggt agtgaccttg 
cactcaacat acttcatcta 
cactctttat ttaactcagg 
tactattaat catacgtttg 
tgecacagag aaatcaaatg 
gtcacagtcg gtgattatta 
tgaattgtgt gacaaccctt 
tatgatattc gataatgcat 


tgatgtttca gaaaagtcag 


19740 


19800 


19860 


19920 


19980 


20040 


20100 


20160 


20220 


20280 


20340 


20400 


20460 


20520 


20580 


20640 


20700 


20760 


20820 


20880 


20940 


21000 


21060 


21120 


21180 


21240 


21300 


21360 


21420 


21480 


21540 


21600 


21660 


21720 


21780 


21840 


21900 


21960 


22020 
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gtaattttaa 
ataagggcta 
aacctatttt 
cettttcacc 
taaagccaac 
attgttctca 
aaggaattta 
ctaatattac 
tctatgcatg 
actcaacatt 
tttgcttctc 
tagcgccagg 
tgggttgtgt 
attataaata 
atgtgccttt 
cattaaatga 
tagtactttc 
ctgaccttat 
tgttaactcc 
atttcactga 
cttttggggg 
tatatcaaga 
cagettggeg 
taggagetga 
gtgctagtta 
atactatgte 
ctactaactt 
cegtagattg 
aatatggtag 
atcgcaacac 
aatattttgg 
ggtettttat 
agcaatatgg 
tcaatggact 
ctgctctagt 
aaataccttt 
ttctctatga 
aagaatcact 
atgctcaagc 


gtgtgctaaa 


acacttacga 
tcaacctata 
taagttgcct 
tgctcaagac 
tacatttatg 
aaatccactt 
ccagacctct 
aaacttgtgt 
ggagagaaaa 
tttttcaacc 
caatgtctat 
acaaactggt 
ccttgcttgg 
taggtatctt 
ctcccctgat 
ttatggtttt 
ttttgaactt 
taagaaccag 
ttcttcaaag 
ttccgttcga 
tgtaagtgta 
tgttaactgc 
catatattct 
gcatgtcgac 
ccatacagtt 
tttaggtget 
ttcaattage 
taatatgtac 
cttttgcaca 
acgtgaagtg 
tggttttaat 
tgaggacttg 
cgaatgccta 
tacagtgttg 
tagtggtact 
tgctatgcaa 
gaaccaaaaa 
tacaacaaca 
attaaacaca 


tgatatcctt 


63 


gagtttgtgt 
gatgtagttc 
cttggtatta 
atttggggca 
ctcaagtatg 
gctgaactca 
aatttcaggg 
ccttttggag 
aaaatttcta 
tttaagtgct 
gcagattctt 
gttattgctg 
aatactagga 
agacatggca 
ggcaaacctt 
tacaccacta 
ttaaatgcac 
tgtgtcaatt 
agatttcaac 
gatcctaaaa 
attacacctg 
actgatgttt 
actggaaaca 
acttcttatg 
tctttattac 
gatagttcaa 
attactacag 
atctgcggag 
caactaaatc 
ttegetcaag 
ttttcacaaa 
ctetttaata 
ggtgatatta 
ccacctctgc 
gccactgctg 
atggcatata 
caaatcgcca 
tcaactgcat 
cttgttaaac 


tegegacttg 


ttaaaaataa 
gtgatctacc 
acattacaaa 
cgtcagctgc 
atgaaaatgg 
aatgctctgt 
ttgttccctc 
aggtttttaa 
attgtgttgc 
atggegttto 
ttgtagtcaa 
attataatta 
acattgatgc 
agcttaggcc 
gcaccccacc 
ctggcattgg 
cggccacggt 
ttaattttaa 
catttcaaca 
catctgaaat 
gaacaaatgc 
ctacagcaat 
atgtattcca 
agtgcgacat 
gtagtactag 
ttgcttactc 
aagtaatgcc 
attctactga 
gtgcactctc 
tcaaacaaat 
tattacctga 
aggtgacact 
atgctagaga 
tcactgatga 
gatggacatt 
ggttcaatgg 
accaatttaa 
tgggcaagct 
aacttagctc 


ataaagtcga 
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agatgggttt 
ttctggtttt 
ttttagagcc 
agcctatttt 
tacaatcaca 
taagagcttt 
aggagatgtt 
tgctactaaa 
tgattactct 
tgccactaag 
gggagatgat 
taaattgcca 
tacttcaact 
ctttgagaga 
tgctcttaat 
ctaccaacct 
ttgtggacca 
tggactcact 
atttggccgt 
attagacatt 
ttcatctgaa 
tcatgcagat 
gactcaagca 
tcctattgga 
ccaaaaatct 
taataacacc 
tgtttctatg 
atgtgctaat 
aggtattget 
gtacaaaacc 
ccctctaaag 
cgctgatgct 
tctcatttgt 
tatgattgct 
tggtgctggc 
cattggagtt 
caaggcgatt 
gcaagacgtt 
taattttggt 


ggcggaggta 


ctctatgttt 
aacactttga 
attcttacag 
gttggctatt 
gatgctgttg 
gagattgaca 
gtgagattcc 
ttcccttctg 
gtgctctaca 
ttgaatgatc 
gtaagacaaa 
gatgatttca 
ggtaattata 
gacatatcta 
tgttattggc 
tacagagttg 
aaattatcca 
ggtactggtg 
gatgtttctg 
tcaccttgct 
gttgctgttc 
caactcacac 
ggctgtctta 
gctggcattt 
attgtggctt 
attgctatac 
gctaaaacct 
ttgcttctcc 
gctgaacagg 
ccaactttga 
ccaactaaga 
ggcttcatga 
gcgcagaagt 
gcctacactg 
gctgctcttc 
acccaaaatg 
agtcaaattc 
gttaaccaga 
gcaatttcaa 


caaattgaca 


22080 


22140 


22200 


22260 


22320 


22380 


22440 


22500 


22560 


22620 


22680 


22740 


22800 


22860 


22920 


22980 


23040 


23100 


23160 


23220 


23280 


23340 


23400 


23460 


23520 


23580 


23640 


23700 


23760 


23820 


23880 


23940 


24000 


24060 


24120 


24180 


24240 


24300 


24360 


24420 


64 


ggttaattac 
ctgctgaaat 
gacaatcaaa 
cagecccgca 
tcaccacagc 
ttgtgtttaa 
ttactacaga 
acacagttta 
acttcaaaaa 
ctgtegtcaa 
aatcactcat 
atgtttgget 
gttgcatgac 
agtttgatga 
cgaacttatg 
aaaaattgac 
agcctcactc 
cgctaccaaa 
gttcatttgc 
tgcaggtatg 
caacgcatgt 
attactttat 
accatataac 
aaaactcaaa 
agactatgtc 
aattactaca 
agacccaccg 
aatggatcca 
aagtgagtac 
tagegtactt 
tgegettega 
ggtttacgte 
ggtctaaacg 
gcagacaacg 
gtaataggtt 
aacaggtttt 
gettgttttg 
gcaatggett 


tttgetegta 


aggcagactt 
cagggettet 
aagagttgac 
tggtgttgtc 
gecagcaatt 
tggcacttct 
caatacattt 
tgatcctctg 
tcatacatca 
cattcaaaaa 
tgaccttcaa 
cggetteatt 
tagttgttge 
ggatgactct 
gatttgttta 
aatgcttctc 
cettteggat 
ataattgcgc 
aatttactgc 
gaggcgcaat 
agaattatta 
gatgccaact 
agtgtcacag 
gaagactacc 
gttgtacatg 
gacactggta 
aatgtgcaaa 
atttatgatg 
gaacttatgt 
etttttcttg 
ttgtgtgegt 
tactegegtg 
aactaactat 
gtactattac 
tcctattcct 
tgtacataat 
tgcttgctgc 
gtattgtagg 


cccgctcaat 


65 


caaagccttc 
gctaatcttg 
ttttgtggaa 
ttcctacatg 
tgtcatgaag 
tggtttatta 
gtctcaggaa 
caacctgagc 
ccagatgttg 
gaaattgacc 
gaattgggaa 
gctggactaa 
agttgcctca 
gagccagttc 
tgagattttt 
ctgcaagtac 
ggcttgttat 
tcaataaaag 
tgctatttgt 
ttttgtacct 
tgagatgttg 
actttgtttg 
atacaattgt 
aaattggtgg 
gctatttcac 
ttgaaaatgc 
tacacacaat 
agccgacgac 
actcattcgt 
ctttcgtggt 
actgctgcaa 
ttaaaaatct 
tattattatt 
cgttgaggag 
agcctggatt 
aaagcttgtt 
tgtctacaga 
cttgatgtgg 


gtggtcattc 


aaacctatgt 
ctgctactaa 
agggctacca 
tcacgtatgt 
gcaaagcata 
cacagaggaa 
attgtgatgt 
tegactcatt 
atcttggcga 
gcctcaatga 
aatatgagca 
ttgecatcgt 
agggtgcatg 
tcaagggtgt 
tactcttgga 
tgttcatgct 
tggcgttgca 
atggcagcta 
taccatctat 
ctatgccttg 
gctttgttgg 
ctggcacaca 
egttactgaa 
ttattctgag 
cgaagtttac 
tacattcttc 
cgacggctct 
gactactagc 
ttcggaagaa 
attcttgcta 
tattgttaac 
gaactcttct 
ctgtttggaa 
cttaaacaac 
atgttactac 
ttcctctggc 
attaattggg 
cttagctact 


aacccagaaa 
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aacacaacaa 
aatgtctgag 
ccttatgtcc 
gccatcccag 
cttccctoegt 
cttcttttct 
cgttattggc 
caaagaagag 
catttcaggc 
ggtegetaaa 
atatattaaa 
catggttaca 
ctcttgtggt 
caaattacat 
tcaattactg 
acagcaacga 
tttettgetg 
gccetttata 
tcacatcttt 
atatattttc 
aagtgcaaat 
cataactatg 
ggtgacggca 
gataggcact 
taccagcttg 
atctttaaca 
tcaggagttg 
gtgeetttgt 
acaggtacgt 
gtcacactag 
gtgagtttag 
gaaggagttc 
ctttaacatt 
tectggaaca 
aatttgccta 
tcttgtggcc 
tgactggcgg 
tcgttgcttc 


caaacattct 


ctaatcaggg 
tgtgttcttg 
ttcccacaag 
gagaggaact 
gaaggtgttt 
ccacaaataa 
atcattaaca 
ctggacaagt 
attaacgett 
aatttaaatg 
tggecttggt 
atettgettt 
tcttgctgca 
tacacataaa 
cacagccagt 
taccgctaca 
tttttcagag 
agggcttcca 
tgettgtege 
tacaatgcat 
ccaagaaccc 
actactgtat 
tttcaacacc 
caggtgttaa 
agtctacaca 
agcttgttaa 
ctaatccagc 
aagcacaaga 
taatagttaa 
ccatccttac 
taaaaccaac 
ctgatcttct 
gcttatcatg 
atggaaccta 
ttctaatcgg 
agtaacactt 
gattgcgatt 
cttcaggctg 


tctcaatgtg 


24480 


24540 


24600 


24660 


24720 


24780 


24840 


24900 


24960 


25020 


25080 


25140 


25200 


25260 


25320 


25380 


25440 


25500 


25560 


25620 


25680 


25740 


25800 


25860 


25920 


25980 


26040 


26100 


26160 


26220 


26280 


26340 


26400 


26460 


26520 


26580 


26640 


26700 


26760 
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cctctccggg 
gtgatcattc 
gacctgccaa 
gegtegeage 
aactataaat 
taagtgacaa 
tatcattatg 
agtgagacaa 
acctatggag 
ttgtatttac 
tactaaaaga 
ctgacaataa 
gtactcgaca 
aagaggaggt 
ttttaatact 
ettctatttg 
ttggttttca 
gaaacttctc 
gegctgtgca 
gtaatactta 
ggcacactat 
gtggtgeget 
gagacgtact 
tcaaaccaac 
aaccagaatg 
aataatactg 
cctcgaggcc 
taccgaagag 
agatggtact 
aaagaaggca 
ggcaccegca 
ttgccaaaag 
tcatcacgta 
cctgctcgaa 
ttgaaccagc 
actaagaaat 
cagtacaacg 
ggggaccaag 
tttgctccaa 


tcgggaacat 


ggacaattgt 
gtggtcactt 
aagagatcac 
gtgtaggcac 
taaatacaga 
cagatgtttc 
aggactttca 
ttatttaagc 
ttagattatc 
atcttgcgag 
accttgccca 
atttgcacta 
tacctatcag 
tcaacaagag 
ttgcttcacc 
tgctttttag 
ctcgaaatcc 
attgttttga 
tctaataaac 
tagcactgct 
ggttcaaaca 
tatagctagg 
tgttgtttta 
gtagtgccec 
gaggacgcaa 
egtettggtt 
agggcgttcc 
ctacccgacg 
tctattacct 
tcgtatgggt 
atcctaataa 
gettctacge 
gtegeggtaa 
tggctagcgg 
ttgagagcaa 
ctgctgctga 
tcactcaagc 
acctaatcag 
gtgectctgc 


ggctgactta 


67 


gaccagaccg 
gegaatggee 
tgtggctaca 
tgattcaggt 
ccacgccggt 
atcttgttga 
ggattgctat 
ctctaactaa 
cataaaacga 
ctatatcact 
tcaggaacat 
acttgcacta 
ctgcgtgcaa 
ctctactcgc 
attaagagaa 
cetttetget 
aggatctaga 
cttgtatttc 
ctcatgtgct 
tggetttgtg 
tgcacaccta 
tgttggtacc 
aataaacgaa 
ccgcattaca 
tggggcaagg 
cacagctctc 
aatcaacacc 
agttcgtggt 
aggaactggc 
tgcaactgag 
caatgctgcc 
agagggaagc 
ttcaagaaat 
aggtggtgaa 
agtttctggt 
ggcatctaaa 
atttgggaga 
acaaggaact 
attctttgga 


tcatggagcc 


ctcatggaaa 
ggacaccccc 
tcacgaacgc 
tttgctgcat 
agcaacgaca 
cttccaggtt 
ttggaatctt 
gaagaattat 
acatgaaaat 
atcaggagtg 
acgagggcaa 
gcacacactt 
gatcagtttc 
cactttttct 
agacagaatg 
attccttgtt 
agaaccttgt 
tctatgcagt 
tgaagatcct 
ctctaggaaa 
atgttactat 
ttcatgaagg 
caaattaaaa 
tttggtggac 
ccaaaacage 
actcagcatg 
aatagtggtc 
ggtgacggca 
ccagaagctt 
ggagccttga 
accgtgctac 
agaggcggca 
tcaactcctg 
actgeceteg 
aaaggccaac 
aagcctcgcc 
egtggtccag 
gattacaaac 
atgtcacgca 


attaaattgg 
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gtgaacttgt cattggtget 
tagggcgctg tgacattaag 
tttcttatta caaattagga 
acaaccgcta ccgtattgga 
atattgcttt gctagtacag 
acaatagcag agatattgat 
gacgttataa taagttcaat 
tcggagttag atgatgaaga 
tattctcttc ctgacattga 
tgttagaggt acgactgtac 
ttcaccattt caccctcttg 
tgcttttgct tgtgctgacg 
accaaaactt ttcatcagac 
cattgttgct gctctagtat 
aatgagctca ctttaattga 
ttaataatge ttattatatt 
accaaagtct aaacgaacat 
tgcatatgca ctgtagtaca 
tgtaaggtac aacactaggg 
ggttttacct tttcatagat 
caactgtcaa gatccagctg 
tcaccaaact gctgcattta 
tgtctgataa tggaccccaa 
ccacagattc aactgacaat 
gccgacccca aggtttaccc 
gcaaggagga acttagattc 
cagatgacca aattggctac 
aaatgaaaga getcagcccc 
cacttcccta cggcgctaac 
atacacccaa agaccacatt 
aacttcctca aggaacaaca 
gtcaagcctc ttetegetee 
gcagcagtag gggaaattct 
cgetattget gctagacaga 
aacaacaagg ccaaactgtc 
aaaaacgtac tgccacaaaa 
aacaaaccca aggaaatttc 
attggccgca aattgcacaa 
ttggcatgga agtcacacct 


atgacaaaga tccacaattc 


26820 


26880 


26940 


27000 


27060 


27120 


27180 


27240 


27300 


27360 


27420 


27480 


27540 


27600 


27660 


27720 


27780 


27840 


27900 


27960 


28020 


28080 


28140 


28200 


28260 


28320 


28380 


28440 


28500 


28560 


28620 


28680 


28740 


28800 


28860 


28920 


28980 


29040 


29100 


29160 


68 


aaagacaacg 


gagcctaaaa 


aagaagcagc 


cttcaaaatt 


accacacaag 


tactcttgtg 


atctcacata 


cattttcatc 


etgectatat 


attttaatag 


<210> SEQ I 


<211> LENGT 
<212> TYPE: 


tcatactget 


aggacaaaaa 


ccactgtgac 


ccatgagtgg 


gcagatgggc 


cagaatgaat 


gcaatcttta 


gaggccacgc 


ggaagagece 


cttcttagga 


D NO 2 


H: 4382 
PRT 


69 


gaacaagcac 
gaaaaagact 
tcttcttcct 
agcttctgct 
tatgtaaacg 
tctcgtaact 
atcaatgtgt 
ggagtacgat 
taatgtgtaa 


gaatgac 


«213» ORGANISM: Coronavirus 


«400» SEQUENCE: 2 


Met Glu Ser 
L 


Ser Leu Pro 
Asp Ser Val 
35 


Gly Thr Cys 
50 


Glu Gln Pro 
65 


His Gly His 
Tyr Gly Arg 
Glu Thr Pro 


115 


Lys Gly Ala 
130 


Leu Gly Asp 
145 


Trp Asn Thr 


Leu Asn Gly 


Pro Asp Gly 
195 


Gly Lys Ser 
210 


Lys Arg Gly 
225 


Phe Thr Glu 


Ile Lys Ser 


Leu Val Le 
5 


Val Leu G1 
20 


Glu Glu Al 


Gly Leu Va 


u Gly Val 


n Val Arg 


a Leu Ser 
40 


1 Glu Leu 
55 


Tyr Val Phe Ile Lys 


70 


Lys Val Va 
85 


Ser Gly Il 
100 


1 Glu Leu 


e Thr Leu 


Ile Ala Tyr Arg Asn 


Gly Gly Hi 
Glu Leu Gl 
15 


Lys His Gl 
165 


Gly Ala Va 
180 


Tyr Pro Le 


120 


s Ser Tyr 
135 


y Thr Asp 
0 


y Ser Gly 


1 Thr Arg 


u Asp Cys 
200 


Met Cys Thr Leu Ser 


215 


Val Tyr Cys Cys Arg 


23 


Arg Ser As 
245 


0 


p Lys Ser 


Ala Lys Lys Phe Asp 


260 


attgacgcat 
gatgaagetc 
geggetgaca 
gattcaactc 
ttttcgcaat 
aaacagcaca 
aacattaggg 


cgagggtaca 


aattaatttt 


Asn Glu Lys 
10 


Asp Val Leu 
25 


Glu Ala Arg 


Glu Lys Gly 


Arg Ser Asp 


75 


Val Ala Glu 
90 


Gly Val Leu 
105 


Val Leu Leu 


Gly Ile Asp 


Pro Ile Glu 


155 


Ala Leu Arg 
170 


Tyr Val Asp 
185 


Ile Lys Asp 


Glu Gln Leu 


Asp His Glu 
235 


Tyr Glu His 
250 


Thr Phe Lys 
265 
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acaaaacatt 


agcctttgcc 


tggatgattt 


aggcataaac 


tccgtttacg 


agtaggttta 


aggacttgaa 


gtgaataatg 


agtagtgcta 


Thr His Val 


cccaccaaca 


gcagagacaa 


ctccagacaa 


actcatgatg 


atacatagtc 


gttaacttta 


agagccacca 


ctagggagag 


tccccatgtg 


Gln Leu 
15 


Val Arg Gly Phe Gly 


30 


Glu His Leu Lys Asn 


45 


Val Leu Pro 


60 


Ala Leu Ser 


Gln Leu 


Thr Asn 
80 


Met Asp Gly Ile Gln 


Val Pro His 
110 


95 


Val Gly 


Arg Lys Asn Gly Asn 


125 


Leu Lys Ser 


140 


Tyr Asp 


Asp Tyr Glu Gln Asn 


Glu Leu Thr 


Asn Asn Phe 
190 


Phe Leu Ala 


205 


Asp Tyr Ile 


220 


His Glu Ile 


Gln Thr Pro 


Gly Glu Cys 
270 


160 


Arg Glu 
175 


Cys Gly 


Arg Ala 


Glu Ser 


Ala Trp 
240 


Phe Glu 
255 


Pro Lys 


29220 


29280 


29340 


29400 


29460 


29520 


29580 


29640 


29700 


29727 


70 


Phe 


Glu 


Pro 


305 


Met 


Leu 


Gly 


Pro 


Ala 


385 


Gly 


Tyr 


Ser 


Asp 


Gly 


465 


Ser 


Lys 


Lys 


val 


Arg 


545 


Asp 


Ser 


Asn 


Thr 


Arg 


625 


Phe 


Phe 


Asp 


val 


Lys 


290 


val 


Lys 


Lys 


Pro 


Cys 


370 


Asp 


Arg 


Asn 


Gly 


Leu 


450 


Asp 


Ala 


Ser 


Gly 


Leu 


530 


Ser 


Leu 


Leu 


Ser 


Ser 


610 


Pro 


Leu 


Asp 


Cys 


Phe 


275 


Lys 


Ala 


Cys 


Ala 


Thr 


355 


Pro 


Tyr 


Thr 


Lys 


His 


435 


Leu 


Phe 


Ser 


Phe 


Lys 


515 


Thr 


Ile 


Gln 


Arg 


val 


595 


Gln 


Ile 


Lys 


Ile 


Val 
675 


Pro 


Lvs 


Ser 


Asn 


Thr 


340 


Thr 


Ala 


His 


Arg 


Arg 


420 


Thr 


Glu 


His 


Thr 


Lvs 


500 


Pro 


Pro 


Phe 


Arg 


Leu 


580 


Ile 


Trp 


Phe 


Asp 


Val 


660 


Lvs 


Leu 


Thr 


Pro 


His 


325 


Cys 


Cys 


Cys 


Asn 


Cys 


405 


Ala 


Gly 


Ile 


Leu 


Ser 


485 


Thr 


val 


Leu 


Ala 


Ala 


565 


Val 


Ile 


Leu 


Glu 


Ala 


645 


Lys 


Cys 


Asn 


Glu 


Gln 


310 


Cys 


Glu 


Gly 


Gln 


His 


390 


Phe 


Tyr 


Tle 


Leu 


Asn 


470 


Ala 


Ile 


Lys 


Cys 


Arg 


550 


Ala 


Asp 


Met 


Ser 


Trp 


630 


Trp 


Gly 


Phe 


71 


Ser 
Gly 
295 
Glu 
Asp 
His 
Tyr 
Asp 
375 
Ser 
Gly 
Trp 
Thr 
Ser 
455 
Glu 
Phe 
val 
Gly 
Gly 
535 
Thr 
val 
Ala 
Ala 
Asn 
615 
Tle 
Glu 


Gln 


Tle 


Lys 


280 


Phe 


Cys 


Glu 


Cys 


Leu 


360 


Pro 


Asn 


Gly 


Val 


Gly 


440 


Arg 


Glu 


Ile 


Glu 


Ala 


520 


Phe 


Leu 


Thr 


Met 


Tyr 


600 


Leu 


Glu 


Ile 


Ile 


Asp 
680 


Val 


Met 


Asn 


Val 


Gly 


345 


Pro 


Glu 


Ile 


Cys 


Pro 


425 


Asp 


Glu 


Val 


Asp 


Ser 


505 


Trp 


Pro 


Asp 


Ile 


Val 


585 


Val 


Leu 


Ala 


Leu 


Gln 


665 


Val 


Lys 


Gly 


Asn 


Ser 


330 


Thr 


Thr 


Ile 


Glu 


Val 


410 


Arg 


Asn 


Arg 


Ala 


Thr 


490 


Cys 


Asn 


Ser 


Ala 


Leu 


570 


Tyr 


Thr 


Gly 


Lys 


Lys 


650 


Val 


Val 


Val 


Arg 


Met 
315 


Trp 


Glu 


Asn 


Gly 


Thr 


395 


Phe 


Ala 


Val 


Val 


Ile 


475 


Ile 


Gly 


Ile 


Gln 


Ala 


555 


Asp 


Thr 


Gly 


Thr 


Leu 


635 


Phe 


Ala 


Asn 
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-continued 


Ile 


Ile 


300 


His 


Gln 


Asn 


Ala 


Pro 


380 


Arg 


Ala 


Ser 


Glu 


Asn 


460 


Tle 


Lys 


Asn 


Gly 


Ala 


540 


Asn 


Gly 


Ser 


Gly 


Thr 


620 


Ser 


Leu 


Ser 


Lys 


Gln 


285 


Arg 


Leu 


Thr 


Leu 


Val 


365 


Glu 


Leu 


Tyr 


Ala 


Thr 


445 


Tle 


Leu 


Ser 


Tyr 


Glin 


525 


Ala 


His 


Ile 


Asp 


Leu 


605 


Val 


Ala 


Ile 


Asp 


Ala 
685 


Pro 


Ser 


Ser 


Cys 


Val 


350 


Val 


His 


Arg 


Val 


Asp 


430 


Leu 


Asn 


Ala 


Leu 


Lys 


510 


Gln 


Gly 


Ser 


Ser 


Leu 


590 


Val 


Glu 


Gly 


Thr 


Asn 


670 


Leu 


Arg 


Val 


Thr 


Asp 


335 


Ile 


Lys 


Ser 


Lys 


Gly 


415 


Ile 


Asn 


Ile 


Ser 


Asp 


495 


Val 


Arg 


Val 


Ile 


Glu 


575 


Leu 


Gln 


Lys 


Val 


Gly 


655 


Ile 


Glu 


Val 


Tyr 


Leu 


320 


Phe 


Glu 


Met 


Val 


Gly 


400 


Cys 


Gly 


Glu 


Val 


Phe 
480 


Tyr 


Thr 


Ser 


Ile 


Pro 


560 


Gln 


Thr 


Gln 


Leu 


Glu 


640 


Val 


Lys 


Met 


72 
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-continued 


Cys Ile Asp Gln Val Thr Ile Ala Gly Ala Lys Leu Arg Ser Leu Asn 
690 695 700 


Leu Gly Glu Val Phe Ile Ala Gln Ser Lys Gly Leu Tyr Arg Gln Cys 
705 710 715 720 


Ile Arg Gly Lys Glu Gln Leu Gln Leu Leu Met Pro Leu Lys Ala Pro 
725 730 735 





Lys Glu Val Thr Phe Leu Glu Gly Asp Ser His Asp Thr Val Leu Thr 
740 745 750 


Ser Glu Glu Val Val Leu Lys Asn Gly Glu Leu Glu Ala Leu Glu Thr 
755 760 765 


Pro Val Asp Ser Phe Thr Asn Gly Ala Ile Val Gly Thr Pro Val Cys 
770 775 780 


Val Asn Gly Leu Met Leu Leu Glu Ile Lys Asp Lys Glu Gln Tyr Cys 
785 790 795 800 


Ala Leu Ser Pro Gly Leu Leu Ala Thr Asn Asn Val Phe Arg Leu Lys 
805 810 815 


Gly Gly Ala Pro Ile Lys Gly Val Thr Phe Gly Glu Asp Thr Val Trp 
820 825 830 


Glu Val Gln Gly Tyr Lys Asn Val Arg Ile Thr Phe Glu Leu Asp Glu 
835 840 845 


Arg Val Asp Lys Val Leu Asn Glu Lys Cys Ser Val Tyr Thr Val Glu 
850 855 860 


Ser Gly Thr Glu Val Thr Glu Phe Ala Cys Val Val Ala Glu Ala Val 
865 870 875 880 


Val Lys Thr Leu Gln Pro Val Ser Asp Leu Leu Thr Asn Met Gly Ile 
885 890 895 


Asp Leu Asp Glu Trp Ser Val Ala Thr Phe Tyr Leu Phe Asp Asp Ala 
900 905 910 


Gly Glu Glu Asn Phe Ser Ser Arg Met Tyr Cys Ser Phe Tyr Pro Pro 
915 920 925 


Asp Glu Glu Glu Glu Asp Asp Ala Glu Cys Glu Glu Glu Glu Ile Asp 
930 935 940 


Glu Thr Cys Glu His Glu Tyr Gly Thr Glu Asp Asp Tyr Gln Gly Leu 
945 950 955 960 


Pro Leu Glu Phe Gly Ala Ser Ala Glu Thr Val Arg Val Glu Glu Glu 
965 970 975 


Glu Glu Glu Asp Trp Leu Asp Asp Thr Thr Glu Gln Ser Glu Ile Glu 
980 985 990 


Pro Glu Pro Glu Pro Thr Pro Glu Glu Pro Val Asn Gln Phe Thr Gly 
995 1000 1005 


Tyr Leu Lys Leu Thr Asp Asn Val Ala Ile Lys Cys Val Asp Ile 
010 1015 1020 


Val Lys Glu Ala Gln Ser Ala Asn Pro Met Val Ile Val Asn Ala 
025 1030 1035 


Ala Asn Ile His Leu Lys His Gly Gly Gly Val Ala Gly Ala Leu 
040 1045 1050 


Asn Lys Ala Thr Asn Gly Ala Met Gln Lys Glu Ser Asp Asp Tyr 
055 1060 1065 


Ile Lys Leu Asn Gly Pro Leu Thr Val Gly Gly Ser Cys Leu Leu 
070 1075 1080 


Ser Gly His Asn Leu Ala Lys Lys Cys Leu His Val Val Gly Pro 
085 1090 1095 





Asn Leu Asn Ala Gly Glu Asp Ile Gln Leu Leu Lys Ala Ala Tyr 


Glu 


Ala 


Val 


Ala 


Pro 


Asp 


Val 


Leu 


Asp 


Gly 


Gly 


Ser 


Lys 


Gly 


Lys 


Ala 


Met 


Met 


Gly 


Phe 


Leu 


Val 


Ser 


Val 


Glu 


Asp 


00 


Asn 
15 





Lvs 
205 


Glu 
220 





Ile 
1235 


Glu 
1250 


Asp 
1265 


Lys 
1280 


Lys 
1295 


Cys 
1310 


Cys 
1325 


Lys 
1340 


Leu 
1355 


Asp 
1370 


Ile 
385 


Phe 
400 


Asn 
415 
430 
Leu 
445 
460 


Glu 
475 


Trp 
490 





Phe 


Ile 


Thr 


Tyr 


Val 


Lys 


Pro 


Glu 


Asn 


Asp 


val 


Lys 


val 


Ala 


Lys 


Glu 


Ala 


Val 


Lys 


Tyr 


Ser 


His 


Lys 


Thr 


His 


Ser 


Asn 


Phe 


Val 


Glu 


Glu 


Thr 


Lys 


Thr 


Gly 


Met 


Tle 


Ala 


Pro 


Gly 


Ser 


Glu 


His 


Arg 


Ile 


Thr 


Leu 


Gly 


Ala 


Tyr 


Phe 


Tyr 


Ser 


Gly 


Arg 





Lys 


Lys 


Ser 


Thr 


Gly 


Val 


Tyr 


Ala 


Ile 


Ala 


Ala 


Gln 


Ser 


Asn 


Phe 


Pro 


Asn 


Val 


Ser 


Gln 


Ala 


Thr 


val 


Pro 


Glu 


Lys 


Phe 


Leu 


Phe 


Ser 


Gly 


Asp 


Thr 


Phe 


Leu 


Glu 


Ile 


Glu 


Lys 


Glu 


Asn 


Ala 


Gly 


Glu 


Gly 


75 


05 


20 


Lvs 
35 


Lys 
80 


Lys 
95 





210 


Leu 
225 


Tyr 
240 





Leu 
1255 


Gly 
1270 


Thr 
1285 


Glu 
1300 


Leu 
1315 


Tyr 
1330 


Gly 
1345 


Glu 
1360 


Met 
1375 


Gly 
1390 


Glu 
1405 


Pro 
1420 


Leu 
1435 


val 
1450 


Tyr 
1465 


Thr 
1480 


Gln 
1495 


Ile 


Pro 


Val 


Met 


Gln 


Ser 


Cys 


Thr 


His 


Glu 


Asp 


Thr 


Tyr 


Glu 


Val 


Thr 


Thr 


Ala 


Ile 


Pro 


Leu 


Glu 


val 


Leu 


Val 


Arg 


Leu 


Leu 


Tyr 


Asp 


Glu 


Val 


Ile 


Asn 


Asp 


Lys 


Ile 


Glu 


Ile 


Glu 


Leu 


Val 


Arg 


Thr 


Val 


Val 


Val 


Glu 


Ser 


Thr 


Ser 


Thr 


Leu 


Gln 


Ile 


Tyr 


Glu 


Val 


Asp 


Lys 


Ser 


Asp 


Thr 


Met 


Thr 


Ala 


Pro 


Ser 


Lys 


Ile 


Asp 


Ala 


Thr 


Ala 


Val 


Ser 


Leu 


Glu 
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Ala 


Ser 


Ala 


Leu 


Pro 


Gln 


Glu 


Leu 


Glin 


Ala 


Cys 


Leu 


Thr 


Lys 


Ser 


Trp 


Leu 


Gln 


Tyr 


Ser 


Met 


Ala 


Ser 


Ser 


Ala 


Leu 


-continued 


1110 


Pro 
1125 


Leu 
1140 


Val 
1155 


Asp 
1170 


Pro 
1185 


Lys 
1200 


Val 
1215 


Leu 
1230 


Asn 
1245 


Pro 
1260 


val 
1275 


Ser 
1290 


Tyr 
1305 


Thr 
1320 


Glu 
1335 


Asn 
1350 


Met 
1365 


Arg 
1380 


Gly 
1395 


Ile 
1410 


Pro 
1425 


Arg 
1440 


Ser 
1455 


Ser 
1470 


Gly 
1485 


Gly 
1500 


Leu 


Gln 


Asn 


Asn 


Asn 


Pro 


Thr 


Leu 


Met 


Tyr 


Val 


Arg 


Pro 


Ala 


Ala 


Leu 


Pro 


Lys 


Val 


Ile 


Ile 


Cys 


Pro 


Lys 


Ser 


Val 


Leu 


Val 


Asp 


Leu 


Thr 


Val 


Thr 


Phe 


Leu 


Met 


Ile 


Ala 


Gly 


Leu 


Pro 


Arg 


Ile 


Tyr 


Arg 


Thr 


Gly 


Met 


Asp 


Thr 


Tyr 


Glu 


Ser 


Cys 


Lys 


Lys 


Glu 


Asp 


Thr 


Ala 


Arg 


val 


Pro 


Leu 


Glin 


Lys 


Asn 


Glu 


Cys 


Lys 


Phe 


Lys 


Tyr 


Arg 


Ala 


Ser 


Arg 


Phe 


76 


Leu 


Val 


Lys 


Thr 


Ser 


Ala 


Thr 


Phe 


Met 


Gly 


Ser 


Ala 


Ala 


val 


Gln 


Cys 


Ala 


Gly 


Leu 


Ala 


Tyr 


Lys 


ser 


ser 


val 


Lys 


Lys 
505 


Glu 
520 


Ser 
535 
550 


Met 
565 


Asp 
580 


Phe 
595 


Glu 
610 





Ser 
1625 


Gly 
1640 


Ser 
1655 


Pro 
1670 


Ala 
1685 


Gly 
1700 


His 
L715 


Lys 
1730 


val 
1745 


val 
1760 


val 
1775 


Glu 
790 
805 


Glu 
820 


Glu 
835 


Tyr 
850 


865 


Asp 
880 





Arg 


Phe 


Leu 


Val 


Thr 


Val 


Phe 


Tyr 


Ala 


Leu 


Val 


Ala 


Asn 


Glu 


Ala 


His 


Met 


Ser 


Gln 


Tyr 


Gly 


yr 





yr 


Asn 


Gly 


His 


Leu 


Asp 


Tyr 


Thr 


Val 


Tyr 


Leu 


Thr 


Leu 


Leu 


Phe 


Leu 


Asn 


Cys 


Tyr 


Ile 


Gln 


Lys 


Asn 


Leu 


Lys 


Thr 


Thr 


Ala 


Asp 


Leu 


Ser 


Asn 


Gly 


Lys 


Leu 


His 


Asn 


Ser 


Leu 


Gln 


Cys 


Gly 


Leu 


Gly 


Met 


Pro 


Glu 


Leu 


Tyr 


TYT 


Gly 


Thr 


Glu 


Tyr 


Lys 


Asp 


Leu 


Thr 


Gln 


Ile 


Pro 


Thr 


His 


Ile 


Ala 


Glu 


Ala 


Asp 


Glu 


Gln 


Gly 


Cys 


Ser 


Gln 


Gln 


Arg 


Pro 


Ile 


Ile 


Tyr 


77 


Ile 
510 


Gly 
525 
540 


Asn 
555 


Gln 
570 


Lys 
585 


Ser 
600 


Leu 
615 





Thr 
1630 


Lys 
1645 


Leu 
1660 


Ala 
1675 


Leu 
1690 


Val 
1705 


Ser 
1720 


Lys 
1735 


Thr 
1750 


Val 
1765 


Ser 
1780 


Gln 
1795 


Cys 
1810 


Ile 
1825 


val 
1840 


Lys 
1855 


Glu 
1870 


Thr 
1885 


val 


Glu 


Glu 


Leu 


Phe 


Pro 


Asp 


Asp 


Lys 


Trp 


Gln 


Tyr 


Ile 


Arg 


Ala 


Thr 


Leu 


Cys 


Phe 


Gly 


Gly 


Asp 


Thr 


Pro 


Pro 


Glu 


Tyr 


Val 


Val 


His 


Gly 


His 


Asp 


Glu 


Lys 


Ala 


Gln 


Tyr 


Leu 


Glu 


Lys 


Thr 


Ser 


Gly 


val 


Thr 


His 


Gly 


Asp 


Val 


Lys 


Gln 


His 


Leu 


Lys 


Thr 


Pro 


Val 


Thr 


Ser 


Trp 


Asp 


Leu 


Arg 


Ala 


Thr 


Arg 


Thr 


Tyr 


Arg 


Met 


Phe 


Tyr 


Ala 


Val 


Ser 


Leu 


Pro 
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Thr 


Ser 


Thr 


Gln 


Thr 


Asn 


Leu 


Phe 


Lys 


Asn 


Glu 


Ala 


Tyr 


Met 


val 


Leu 


Asp 


Asp 


Met 


Leu 


Thr 


His 


Phe 


Tyr 


Asp 


Ile 


-continued 


Leu 
1515 


Leu 
1530 


Ile 
1545 


Leu 
1560 


Tvr 
1575 
His 
1590 


Arg 
1605 


Leu 
1620 


Phe 
1635 


Asn 
1650 


Val 
1665 


Arg 
1680 


Ser 
1695 


Thr 
1710 


Leu 
1725 


Thr 
1740 


Asn 
1755 


Ala 
1770 


Ser 
1785 


Cvs 
1800 
His 
1815 


Leu 
1830 


Tvr 
1845 


Lvs 
1860 


Gly 
1875 


Asp 
1890 


Glu 


Asp 


Lys 


Val 


Leu 


Glu 


Ser 


Gly 


Pro 


Cys 


Lys 


Ala 


Asn 


His 


Asn 


Gly 


Leu 


Thr 


Ala 


Ala 


Ile 


Thr 


Lys 


Leu 


Tyr 


Leu 


Ser 


Lys 


Val 


Asp 


Asp 


Gly 


Glu 


Arg 


Glin 


Tyr 


Phe 


Gly 


Lys 


Leu 


Val 


Val 


Lys 


Gln 


Pro 


Asn 


Thr 


Lys 


Glu 


Asp 


Tyr 


Val 


Pro 


Leu 


Phe 


Met 


Gly 


Lys 


Ala 


Tyr 


val 


Leu 


Asn 


Asp 


Thr 


Leu 


Val 


Glu 


Thr 


Tyr 


Pro 


Glu 


Ala 


Met 


Thr 


Gly 


Lys 


Pro 


78 


Thr 


Cys 


Phe 


Asp 


Ala 


Trp 


Thr 


Ser 


Asp 


val 


Lys 


Glu 


Ala 


Glu 


Ile 


val 


Ala 


val 


Asn 


ser 





Leu 


Ile 


Gln 
895 


Ser 
910 
925 


Leu 
940 


Ser 
955 
His 
970 


Trp 
985 





Asn 
2000 


Asn 
2015 


Glu 
2030 


Thr 
2045 


Gly 
2060 


Ala 
2075 


Leu 
2090 


Ala 
2105 


Lys 
2120 


Lys 
2135 


Phe 
2150 


Ser 
2165 


val 
2180 


val 
2195 


Leu 
2210 


Ala 
2225 


Cys 
2240 


Thr 
2255 


Ser 
2270 


Gln 


Pro 


Asn 


Lys 


Asn 


Phe 


Ile 


Cys 


Ser 


Leu 


Asn 


Thr 


val 


Tyr 


Ser 


Ala 


Pro 


Arg 


Thr 


Arg 


Lys 


Lys 


Leu 


Ala 


Asn 


Met 


Gly 


Val 


Leu 


Thr 


Pro 


Gly 


Lys 


Asn 


Leu 


Phe 


Ala 


Pro 


Glu 


Lys 


val 


Leu 


Ile 


Phe 


Leu 


Leu 


Ile 


Ser 


Ser 


Leu 


Phe 


Gly 


Asp 


Leu 


Thr 


Pro 


Lys 


Ala 


Asp 


Lys 


Gln 


Arg 


Glu 


Cys 


Thr 


Val 


Val 


Glu 


Ala 


Asn 


Leu 


Ala 


Leu 


Arg 


val 


Pro 


Leu 


Gly 


val 


Phe 


Asp 
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88 


Asp 


Val 


Ile 


Gln 


Tyr 


Asp 


Ala 


Glu 


Arg 


Met 


Asn 


Ile 


Tyr 


Ala 


Lys 


Leu 


val 


Met 


Asn 


Leu 


Pro 


Pro 


Tyr 


Val 


Ala 


Phe 


Ile 
3875 


Ser 
3890 


Asn 
3905 


Ala 
3920 


Ala 
3935 


Ser 
3950 


Lys 
3965 


Lys 
3980 


Ser 
3995 


Leu 
4010 


Ile 
4025 


Pro 
4040 


Gly 
4055 


Ser 
4070 


Ile 
4085 


Ala 
4100 


Lys 
4115 


Ser 
4130 


Lys 


Cys 





Leu 
4205 


Leu 
4220 


Thr 
4235 


Ala 
4250 


Leu 


Leu 


Arg 


Ile 


Thr 


Glu 


Ser 


Met 


Glu 


Phe 


Ile 


Leu 


Thr 


Ala 


Val 


Trp 


Leu 


Cys 


Leu 


Leu 


Ser 


Arg 


Tyr 


Gly 


Glu 


Val 


Leu 


Leu 


Leu 


Ala 


Ala 


Val 


Glu 


Ala 


Asp 


Thr 


Asn 


Thr 


Tyr 


Leu 


Gln 


Pro 


Gln 


Ala 


Ala 


Leu 


Asp 


Phe 


Phe 


Ser 


Val 


Asp 


Ala 


Ser 


Cys 


Ser 


Gln 


val 


Phe 


Asp 


Lys 


Met 


Asn 


Thr 


Lys 


Trp 


Leu 


Leu 


Asn 


Ala 


Tyr 


Ser 


Gly 


Val 


Ile 


Leu 


Pro 


Pro 


Lys 


val 


Glu 


Glu 


Glu 


Leu 


Asp 


Gln 


Arg 


Leu 


Ala 


Ala 


Asn 


Glu 


Ser 


Ile 


Asn 


Gly 


Tyr 


Asp 


Thr 


Thr 


Lys 


Ala 


Ala 


Ala 


89 


Asp 
3880 


Leu 
3895 


Glu 
3910 


Phe 
3925 


Ala 
3940 


Lys 
3955 


Arg 
3970 


Ala 
3985 


Ala 
4000 


Arg 
4015 


Arg 
4030 


Ala 
4045 


Thr 
4060 


Ile 
4075 


Glu 
4090 


Val 
4105 


Glu 
4120 


Thr 
4135 


Asn 
4150 
His 
4165 


Gly 
4180 


Asp 
4195 


Gly 
4210 


Ala 
4225 


Asn 
4240 


Lys 
4255 


Thr 


Leu 


Met 


Ser 


Tyr 


Lys 


Asp 


Met 


Lys 


Lys 


Asp 


Lys 


Cys 


Gln 


Ile 


Thr 


Leu 


Thr 


Asn 


Gln 


Thr 


Thr 


Leu 


Thr 


Ser 


Ala 


Thr 


Ser 


Leu 


Ser 


Glu 


Leu 


Ala 


Thr 


val 


Leu 


Gly 


Leu 


Asp 


Gln 


Asn 


Ala 


Ser 


Gln 


Ser 


Asp 


Ile 


Pro 


Asn 


Val 


Thr 


Tyr 


Glu 


Met 


Asp 


Leu 


Gln 


Lys 


Ala 


Gln 


Thr 


Asp 


Cys 


Met 


Gly 


Val 


Met 


Leu 


Pro 


Thr 


Lys 


Leu 


Tyr 


Lys 


Asn 


Arg 


Val 


Lys 
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Ala 


Gln 


Asn 


Pro 


Ala 


Lys 


Met 


Met 


Ser 


Asn 


val 


val 


Asn 


val 


Asp 


Arg 


val 


Ala 


Gly 


Lys 


Thr 


Gly 


Leu 


Leu 


Leu 


Asp 
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Phe 
3885 


Gly 
3900 


Arg 
3915 


Ser 
3930 


Val 
3945 


Ser 
3960 


Gln 
3975 


Tyr 
3990 


Ala 
4005 


Asp 
4020 


Pro 
4035 


val 
4050 


Thr 
4065 


Asp 
4080 


Asn 
4095 


Ala 
4110 


Ala 
4125 


Cys 
4140 


Gly 
4155 


Trp 
4170 


Glu 
4185 


Pro 
4200 


Asn 
4215 


Gln 
4230 


Ser 
4245 


Tyr 
4260 


Glu 


Ala 


Ala 


Tyr 


Ala 


Leu 


Lys 


Met 


Ala 


Leu 


Val 


Phe 


Ala 


Ser 


Asn 


Leu 


Thr 


Ala 


Leu 


Lys 


Ala 


Phe 


Leu 


Lys 


Val 


Thr 


Ala 


Asn 


Asn 


Lys 


Gln 


Glin 


Leu 


Asn 


Pro 


Thr 


Asp 


Pro 


Ser 


Arg 


Asp 


Phe 


Arg 


Glu 


Val 


Gly 


Gly 


Cys 


Ala 


Met 


Asp 


Leu 


Ala 


Gly 


Val 


Leu 


Ala 


Thr 


Asn 


Ile 


Asp 


Tyr 


Ser 


Asn 


Ala 


Gln 


Asp 


Val 


Phe 


Pro 


Lys 


Met 


Asn 


Ala 


Ser 


90 
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Gly Gly Gln Pro Ile Thr Asn Cys Val Lys Met Leu Cys Thr His 
4265 4270 4275 


Thr Gly Thr Gly Gln Ala Ile Thr Val Thr Pro Glu Ala Asn Met 
4280 4285 4290 


Asp Gln Glu Ser Phe Gly Gly Ala Ser Cys Cys Leu Tyr Cys Arg 
4295 4300 4305 


Cys His Ile Asp His Pro Asn Pro Lys Gly Phe Cys Asp Leu Lys 
4310 4315 4320 


Gly Lys Tyr Val Gln Ile Pro Thr Thr Cys Ala Asn Asp Pro Val 
4325 4330 4335 


Gly Phe Thr Leu Arg Asn Thr Val Cys Thr Val Cys Gly Met Trp 
4340 4345 4350 


Lys Gly Tyr Gly Cys Ser Cys Asp Gln Leu Arg Glu Pro Leu Met 
4355 4360 4365 


Gln Ser Ala Asp Ala Ser Thr Phe Leu Asn Gly Phe Ala Val 
4370 4375 4380 


<210> SEQ ID NO 3 

<211> LENGTH: 2695 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 3 


Arg Val Cys Gly Val Ser Ala Ala Arg Leu Thr Pro Cys Gly Thr Gly 
D 5 10 15 


Thr Ser Thr Asp Val Val Tyr Arg Ala Phe Asp Ile Tyr Asn Glu Lys 
20 25 30 


Val Ala Gly Phe Ala Lys Phe Leu Lys Thr Asn Cys Cys Arg Phe Gln 
35 40 45 


Glu Lys Asp Glu Glu Gly Asn Leu Leu Asp Ser Tyr Phe Val Val Lys 
50 55 60 


Arg His Thr Met Ser Asn Tyr Gln His Glu Glu Thr Ile Tyr Asn Leu 
65 70 75 80 


Val Lys Asp Cys Pro Ala Val Ala Val His Asp Phe Phe Lys Phe Arg 
85 90 95 


Val Asp Gly Asp Met Val Pro His Ile Ser Arg Gln Arg Leu Thr Lys 
100 105 110 


Tyr Thr Met Ala Asp Leu Val Tyr Ala Leu Arg His Phe Asp Glu Gly 
115 120 125 


Asn Cys Asp Thr Leu Lys Glu Ile Leu Val Thr Tyr Asn Cys Cys Asp 
130 135 140 


Asp Asp Tyr Phe Asn Lys Lys Asp Trp Tyr Asp Phe Val Glu Asn Pro 
145 150 155 160 


Asp Ile Leu Arg Val Tyr Ala Asn Leu Gly Glu Arg Val Arg Gln Ser 
165 170 175 


Leu Leu Lys Thr Val Gln Phe Cys Asp Ala Met Arg Asp Ala Gly Ile 
180 185 190 


Val Gly Val Leu Thr Leu Asp Asn Gln Asp Leu Asn Gly Asn Trp Tyr 
195 200 205 


Asp Phe Gly Asp Phe Val Gln Val Ala Pro Gly Cys Gly Val Pro Ile 
210 215 220 


Val Asp Ser Tyr Tyr Ser Leu Leu Met Pro Ile Leu Thr Leu Thr Arg 
225 230 235 240 


Ala Leu Ala Ala Glu Ser His Met Asp Ala Asp Leu Ala Lys Pro Leu 
245 250 255 


Ile 


Leu 


Cys 


Asn 


305 


Val 


Tyr 


His 


Pro 


Thr 


385 


Val 


Lys 


Phe 


Tyr 


Glu 


465 


Ala 


Phe 


Glu 


Thr 


Ala 


545 


Gln 


Thr 


Leu 


Asp 


Ala 


625 


His 


Met 


Lys 


Phe 


Ile 


290 


Val 


Arg 


His 


Ser 


Ala 


370 


Cys 


Lys 


Gly 


Ala 


Asn 


450 


Val 


Asn 


Asn 


Asp 


Ile 


530 


Arg 


Phe 


val 


Lys 


Tyr 


610 


Ser 


Arg 


val 


Trp 


Asp 


275 


Asn 


Leu 


Lys 


Phe 


Ser 


355 


Met 


Phe 


Pro 


Phe 


Gln 


435 


Leu 


Val 


Gln 


Lvs 


Gln 


515 


Thr 


Thr 


His 


Val 


Thr 


595 


Pro 


Leu 


Phe 


Met 


Asp 


260 


Arg 


Cys 


Phe 


Ile 


Arg 


340 


Arg 


His 


Ser 


Gly 


Phe 


420 


Asp 


Pro 


Asp 


Val 


Trp 


500 


Asp 


Gln 


val 


Gln 


Ile 


580 


val 


Lys 


val 


Tyr 


Cys 
660 


Leu 


Tyr 


Leu 


Ser 


Phe 


325 


Glu 


Leu 


Ala 


val 


Asn 


405 


Lys 


Gly 


Thr 


Lys 


Ile 


485 


Gly 


Ala 


Met 


Ala 


Lys 


565 


Gly 


Tyr 


Cys 


Leu 


Arg 


645 


Gly 


Leu 


Phe 


Asp 


Thr 


310 


Val 


Leu 


Ser 


Ala 


Ala 


390 


Phe 


Glu 


Asn 


Met 


Tyr 


470 


val 


Lys 


Leu 


Asn 


Gly 


550 


Leu 


Thr 


Ser 


Asp 


Ala 


630 


Leu 


Gly 


93 


Lys 
Lys 
Asp 
295 
Val 
Asp 
Gly 
Phe 
Ser 
375 
Ala 
Asn 
Gly 
Ala 
Cys 
455 
Phe 
Asn 
Ala 
Phe 
Leu 
535 
Val 
Leu 
Ser 
Asp 
Arg 
615 
Arg 


Ala 


Ser 


Tyr 


Tyr 


280 


Arg 


Phe 


Gly 


Val 


Lys 


360 


Gly 


Leu 


Lys 


Ser 


Ala 


440 


Asp 


Asp 


Asn 


Arg 


Ala 


520 


Lys 


Ser 


Lys 


Lys 


Val 


600 


Ala 


Lys 


Asn 


Leu 


Asp 


265 


Trp 


Cys 


Pro 


Val 


Val 


345 


Glu 


Asn 


Thr 


Asp 


Ser 


425 


Ile 


Ile 


Cys 


Leu 


Leu 


505 


Tyr 


Tyr 


Ile 


Ser 


Phe 


585 


Glu 


Met 


His 


Glu 


Tyr 
665 


Phe 


Asp 


Ile 


Pro 


Pro 


330 


His 


Leu 


Leu 


Asn 


Phe 


410 


Val 


Ser 


Arg 


Tyr 


Asp 


490 


Tyr 


Thr 


Ala 


Cys 


Ile 
570 


Tyr 


Thr 


Pro 


Asn 


Cys 


650 


Val 


Thr 


Gln 


Leu 


Thr 


315 


Phe 


Asn 


Leu 


Leu 


Asn 
395 


Tyr 


Glu 


Asp 


Gln 


Asp 


475 


Lys 


Tyr 


Lys 


Ile 


Ser 


555 


Ala 


Gly 


Pro 


Asn 


Thr 


635 


Ala 


Lys 
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Glu 


Thr 


His 


300 


Ser 


Val 


Gln 


Val 


Leu 


380 


val 


Asp 


Leu 


Tyr 


Leu 


460 


Gly 


Ser 


Asp 


Arg 


Ser 


540 


Thr 


Ala 


Gly 


His 


Met 


620 


Cys 


Gln 


Pro 


Glu 


Tyr 


285 


Cys 


Phe 


Val 


Asp 


Tyr 


365 


Asp 


Ala 


Phe 


Lys 


Asp 


445 


Leu 


Gly 


Ala 


Ser 


Asn 


525 


Ala 


Met 


Thr 


Trp 


Leu 


605 


Leu 


Cys 


Val 


Gly 


Arg 


270 


His 


Ala 


Gly 


Ser 


Val 


350 


Ala 


Lys 


Phe 


Ala 


His 


430 


Tyr 


Phe 


Cys 


Gly 


Met 


510 


Val 


Lys 


Thr 


Arg 


His 


590 


Met 


Arg 


Asn 


Leu 


Gly 
670 


Leu 


Pro 


Asn 


Pro 


Thr 


335 


Asn 


Ala 


Arg 


Gln 


Val 


415 


Phe 


Tyr 


Val 


Ile 


Phe 


495 


Ser 


Ile 


Asn 


Asn 


Gly 


575 


Asn 


Gly 


Ile 


Leu 


Ser 


655 


Thr 


Cys 


Asn 


Phe 


Leu 


320 


Gly 


Leu 


Asp 


Thr 


Thr 


400 


Ser 


Phe 


Arg 


Val 


Asn 


480 


Pro 


Tyr 


Pro 


Arg 


Arg 


560 


Ala 


Met 


Trp 


Met 


Ser 


640 


Glu 


Ser 


94 
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Ser Gly Asp Ala Thr Thr Ala Tvr Ala Asn Ser Val Phe Asn Ile Cys 
675 680 685 


Gln Ala Val Thr Ala Asn Val Asn Ala Leu Leu Ser Thr Asp Gly Asn 
690 695 700 


Lys Ile Ala Asp Lys Tyr Val Arg Asn Leu Gln His Arg Leu Tyr Glu 
705 710 715 720 


Cys Leu Tyr Arg Asn Arg Asp Val Asp His Glu Phe Val Asp Glu Phe 
725 730 735 


Tyr Ala Tyr Leu Arg Lys His Phe Ser Met Met Ile Leu Ser Asp Asp 
740 745 750 


Ala Val Val Cys Tyr Asn Ser Asn Tyr Ala Ala Gln Gly Leu Val Ala 
755 760 765 


Ser Ile Lys Asn Phe Lys Ala Val Leu Tyr Tyr Gln Asn Asn Val Phe 
770 775 780 


Met Ser Glu Ala Lys Cys Trp Thr Glu Thr Asp Leu Thr Lys Gly Pro 
785 790 795 800 


His Glu Phe Cys Ser Gln His Thr Met Leu Val Lys Gln Gly Asp Asp 
805 810 815 


Tyr Val Tyr Leu Pro Tyr Pro Asp Pro Ser Arg Ile Leu Gly Ala Gly 
820 825 830 


Cys Phe Val Asp Asp Ile Val Lys Thr Asp Gly Thr Leu Met Ile Glu 
835 840 845 


Arg Phe Val Ser Leu Ala Ile Asp Ala Tyr Pro Leu Thr Lys His Pro 
850 855 860 


Asn Gln Glu Tyr Ala Asp Val Phe His Leu Tyr Leu Gln Tyr Ile Arg 
865 870 875 880 


Lys Leu His Asp Glu Leu Thr Gly His Met Leu Asp Met Tyr Ser Val 
885 890 895 


Met Leu Thr Asn Asp Asn Thr Ser Arg Tyr Trp Glu Pro Glu Phe Tyr 
900 905 910 


Glu Ala Met Tyr Thr Pro His Thr Val Leu Gln Ala Val Gly Ala Cys 
915 920 925 


Val Leu Cys Asn Ser Gln Thr Ser Leu Arg Cys Gly Ala Cys Ile Arg 
930 935 940 


Arg Pro Phe Leu Cys Cys Lys Cys Cys Tyr Asp His Val Ile Ser Thr 
945 950 955 960 


Ser His Lys Leu Val Leu Ser Val Asn Pro Tyr Val Cys Asn Ala Pro 
965 970 975 


Gly Cys Asp Val Thr Asp Val Thr Gln Leu Tyr Leu Gly Gly Met Ser 
980 985 990 
Tyr Tyr Cys Lys Ser His Lys Pro Pro Ile Ser Phe Pro Leu Cys Ala 
995 1000 1005 


Asn Gly Gln Val Phe Gly Leu Tyr Lys Asn Thr Cys Val Gly Ser 
010 1015 1020 


Asp Asn Val Thr Asp Phe Asn Ala Ile Ala Thr Cys Asp Trp Thr 
025 1030 1035 


Asn Ala Gly Asp Tyr Ile Leu Ala Asn Thr Cys Thr Glu Arg Leu 
040 1045 1050 


Lys Leu Phe Ala Ala Glu Thr Leu Lys Ala Thr Glu Glu Thr Phe 
055 1060 1065 


Lys Leu Ser Tvr Gly Ile Ala Thr Val Arg Glu Val Leu Ser Asp 
070 1075 1080 





Arg Glu Leu His Leu Ser Trp Glu Val Gly Lys Pro Arg Pro Pro 


Leu 


Ser 


Gly 


Val 


Ser 


Gly 


val 


Gln 


Ala 


His 


Pro 


Glu 


val 


val 


val 


Asp 


Thr 


Thr 


Ala 


Leu 


Tyr 


Pro 


Trp 


val 


Ser 


Glu 


085 


Asn 
00 


Lys 
15 








205 


Leu 
1220 


Ala 
1235 


Ile 
1250 


Cys 
1265 


Phe 
1280 


Val 
1295 


Val 
1310 


Pro 
1325 


Leu 
1340 


Ile 
1355 


Glu 
1370 


Lys 
385 


Lys 
400 


Gln 
415 
430 


Ala 
445 


Gln 
460 





475 


Arg 


Val 


Ala 


Asp 


Pro 


Tyr 


Asn 


Pro 


Tyr 


Ala 


Asp 


Phe 


Cys 


Phe 


Asn 


Ala 


Glu 


Gly 


Ile 


Ala 


Gly 


Ile 


Lys 


Ser 


Gly 


Ala 


Asn 


Gln 


Val 


Tyr 


Thr 


Pro 


Tyr 


Pro 


Tyr 


val 


Lys 


Asp 


Thr 


Asp 


Ala 


Gln 


Pro 


Pro 


val 


His 


Val 


Gly 


Ala 


Lys 


Ser 


His 


Tyr 


Ile 


val 


Phe 


Leu 


Thr 


Gln 


Gly 


Pro 


Asp 


Cys 


Lys 


Val 


Glu 


Arg 


Leu 


Glu 


Asp 


Asp 


Lys 


Ile 


val 


val 


Ile 


Glu 


Ser 


val 


Gly 


Tyr 


val 


val 


Leu 


Lys 


Thr 


Ser 


Ala 


Ser 


Phe 


Asn 


Tle 


Leu 


Pro 


Tyr 


Met 


Thr 


Asp 


Thr 


val 


Phe 


Leu 


Tyr 


Cys 


97 


090 


Phe 
05 





Gly 
210 


Ala 
225 





Leu 
1240 


Arg 
1255 


Lys 
1270 


Ala 
1285 


Ser 
1300 


Arg 
1315 


Ala 
1330 


Phe 
1345 


Phe 
1360 


Val 
1375 


Lys 
1390 
His 
1405 


Arg 
1420 


Ile 
1435 


Gly 
1450 


Asp 
1465 


Asn 
1480 


Thr 


Tyr 


Gly 


Thr 


Gln 


Ile 


Gly 


Lys 


Arg 


Cys 


Ile 


Val 


Leu 


Met 


Ala 


Pro 


Asn 


Leu 


Ser 


Ser 


Asp 


Glu 


Ser 


Leu 


Tyr 


val 


Gly 


Thr 


Thr 


Ser 


Glu 


Ser 


Met 


Ser 


Ile 


Glu 


Ile 


Asn 


Pro 


Ala 


Lys 


Arg 


Ser 


Gly 


Ala 


Ala 


val 


Phe 


Pro 


Pro 


Val 


Asn 


Tyr 


Phe 


Thr 


His 


His 


Asp 


Gln 


His 


Val 


Lys 


Pro 


Ser 


Glu 


Thr 


His 


Thr 


Val 


Thr 


Leu 


Gln 


Ser 


Leu 


Tyr 


Thr 


Ile 


Arg 
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Arg 


Glu 


Thr 


Thr 


Tyr 


Glu 


Lys 


Phe 


Tyr 


Ala 


Ala 


Thr 


Thr 


Asn 


Tyr 


Leu 


Cys 


Cys 


Val 


Cys 


Ser 


Thr 


Asn 


Gln 


Phe 


Phe 
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1095 


Val 
1110 


Lys 
1125 


Tyr 
1140 


Val 
1155 


Val 
1170 


Phe 
1185 


Tyr 
1200 


Ala 
1215 


Thr 
1230 


Leu 
1245 


Arg 
1260 


Leu 
1275 


Thr 
1290 


Tyr 
1305 


val 
1320 


Leu 
1335 


Arg 
1350 


Arg 
1365 


Tyr 
1380 


Phe 
1395 


Ala 
1410 


Arg 
1425 


Ser 
1440 


Thr 
1455 


Thr 
1470 


Asn 
1485 


Thr 


Gly 


Lys 


Met 


Ser 


Ser 


Ile 


Ala 


Lys 


Ala 


Glu 


Ala 


Asp 


Tyr 


Thr 


Leu 


Asp 


Lys 


Ile 


Asn 


Gln 


Val 


Gln 


Val 


Lys 


Asp 


Leu 


Pro 


Ile 


Ser 


Thr 


Gly 


Cys 


Tyr 


Arg 


Gln 


Asp 


Leu 


Ile 


Lys 


Met 


Cys 


Asn 


Met 


Asn 


Pro 


Asn 


Asp 


Thr 


Ala 


Asn 


Tyr 


Asn 


Leu 


Thr 


Asn 


Leu 


Leu 


Ser 


Leu 


Val 


Tyr 


Tle 


Ser 


Gly 


Gly 


Lys 


Pro 


Lys 


Phe 


Arg 


Ala 


Ala 


Ser 


Thr 


Ile 


98 


Thr 


Leu 


Asn 


Asp 


Thr 


Val 


Ile 


Pro 


Ala 


Ala 


Val 


Asn 


Gln 


Asn 


Lys 


Phe 


Arg 


ser 


Tyr 


Thr 


Gly 


Leu 


Glu 


Arg 


Lys 


Cys 


Arg 
490 


Tyr 
505 


val 
520 


Cys 
535 
His 
550 


Asp 
565 


Ser 
580 


Asn 
595 





Trp 
1610 


Val 
1625 


Asn 
1640 


Thr 
1655 


Phe 
1670 


Val 
1685 


Gly 
1700 


Glu 
1715 


Thr 
1730 


Ser 
1745 


val 
1760 


Gly 
775 


Asn 
790 


Ala 

805 
Tyr 
820 


Lys 
835 


Phe 
850 


val 
865 





Ala 


Asp 


Ala 


Ser 


Leu 


Ile 


Met 


Met 


Tle 


Gly 


Leu 


Glu 


Lys 


val 


Leu 


Leu 


Cys 


Asp 


Tyr 


Asn 


Ala 


val 


Pro 


Val 


Pro 


Pro 


Lys 


Lys 


Thr 


Lys 


Ser 


Pro 


Met 


Phe 


Gly 


Thr 


val 


Phe 


His 


Arg 


Ser 


Thr 


Cys 


Thr 


Asn 


Leu 


His 


His 


Tle 


Glin 


Val 


Gln 


Ile 


Leu 


Leu 


Ile 


val 


Gly 


Gly 


Ile 


Phe 


Asn 


Ala 


Thr 


Leu 


Ile 


Asp 


Ser 


Leu 


Tyr 


Pro 


Gln 


val 


Glu 


Ile 


His 


Leu 


Ala 


Gly 





Asp 


Ile 


Phe 


Thr 


Asp 


Leu 


Val 


Arg 


Ile 


Lys 


Arg 


Met 


Cys 


Ala 


Phe 


Ser 


Ala 


Cys 


Gly 


Met 


His 


Glu 


99 


Ile 
495 


Phe 
510 


Ala 
525 


Thr 
540 


Ile 
555 


Pro 
570 


Lys 
585 


Arg 
600 





Val 
1615 


Pro 
1630 


Pro 
1645 


Val 
1660 


Pro 
1675 


Ile 
1690 


Val 
1705 


Lys 
1720 


Asp 
1735 


Cys 
1750 


Met 
1765 


Asn 
1780 


Ser 
1795 


Phe 
1810 


Asp 
1825 


Val 
1840 


Asp 
1855 


val 
1870 


Leu 


Thr 


Glu 


Gly 


Lys 


Lys 


Met 


Glu 


Glu 


Leu 


Thr 


Asn 


Leu 


Val 


Val 


Tyr 


Lys 


Trp 


Ile 


His 


Cys 


val 


Glu 


Val 


Ile 


Glu 


Cys 


Ser 


Asn 


Leu 


Phe 


Asp 


Asn 


Glu 


Gly 


Gln 


Gly 


Ala 


Met 


Gln 


Phe 


Phe 


Arg 


Asn 


Asp 


Asp 


Asp 


Lys 


Leu 


Lys 


Gly 


Trp 


Ile 


Leu 


Val 


His 


Lys 


Met 


Tyr 


Ala 


Cys 


Leu 


Tyr 


Lys 


Tyr 


Met 


Val 


Val 


Ala 


His 


Val 


Gln 


Ala 


Arg 


Arg 


Ser 


Asn 


Lys 
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Met 


Glu 


Thr 


Pro 


Thr 


Thr 


Gln 


Ile 


His 


Gly 


Val 


Pro 


Lys 


Leu 


Leu 


Lys 


Thr 


Ser 


Gln 


His 


Ile 


Val 


Val 


Ala 


Pro 


Phe 
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Ser 
1500 


Ile 
1515 


Gly 
1530 


Thr 
1545 


Glu 
1560 


Tyr 
1575 


Val 
1590 


Arg 
1605 


Ala 
1620 


Phe 
1635 


Asp 
1650 


Pro 
1665 


Gly 
1680 


Ser 
1695 


Trp 
1710 


Tle 
1725 


Cys 
1740 


Val 
1755 


Gln 
1770 


Cys 
1785 


Met 
1800 


Asp 
1815 


Asn 
1830 


Leu 
1845 


Lys 
1860 


Tyr 
1875 


Asp Arg Asp 


Pro 


Leu 


Gln 


Gly 


Asn 


His 


Thr 


Ser 


Thr 


Pro 


Leu 


Asp 


Ala 


Gly 


Phe 


Gly 


Trp 


Gln 


Thr 


Trp 


Ser 


Leu 


Ala 


Asp 


Arg 


Phe 


Ala 


Leu 


Arg 


Gly 


Val 


Arg 


Thr 


Glu 


Gly 


Pro 


Thr 


His 


Pro 


ser 


Phe 


Gly 


Val 


Arg 


Ser 


Ala 


Ala 


Ile 


Ala 


Arg 


Lys 


Pro 


Cys 


Leu 


Tyr 


Arg 


Asp 


Gly 


Asn 


Asp 


Trp 


Leu 


Gly 


Glu 


Thr 


Asp 


Phe 


His 


Cys 


Val 


Cys 


Asp 


Lys 


Gln 


100 


Pro 


Tyr 


Trp 


Arg 


Asp 


Ala 


Phe 


val 


Thr 


Glu 


Gly 


Trp 


Asn 


Pro 


Ile 


val 


Glu 


Thr 


Thr 


Glu 


Glu 


Ile 


Ala 


Lys 


Glin 


Phe 


Met 


Cys 
880 


Ala 
895 


Asn 
910 


Phe 
925 


Gly 
940 


Phe 
955 


Tyr 
970 


Ser 
985 





Arg 
2000 


Tyr 
2015 


Phe 
2030 


Asn 
2045 


Val 
2060 


Val 
2075 


Asp 
2090 


Ala 
2105 


Ile 
2120 


val 
2135 


Ile 
2150 


Ser 
2165 


Gly 
2180 


Thr 
2195 


Gln 
2210 


Thr 
2225 


Leu 
2240 


Lys 
2255 


Asp 


Ser 


Thr 


Cys 


Asp 


Gly 


Asp 


Tyr 


Asp 


Cys 


Arg 


Ser 


Thr 


val 


Ser 


val 


Phe 


Lys 


Tle 


Gly 


Ala 


Gln 


Glu 


Ala 


Gln 


Pro 


Pro 


Glu 


Asp 


His 


Asn 


Thr 


Ser 


Lys 


Ser 


Tle 


Asn 


Gln 


Leu 


Phe 


Asn 


Tle 


Glu 


Glu 


Ile 


Trp 


Val 


Cys 


val 


Gly 


Ser 


Phe 


Glu 


Arg 


Phe 


Lys 


His 


val 


Arg 


Leu 


Ser 


Asp 


Asp 


Leu 


Tyr 


Trp 


Thr 


Lys 


Ile 


Ile 


Leu 


Leu 


Asp 


Cys 


Ser 


Asp 


Ser 


val 


Asn 


Thr 


Ser 


Ile 


101 


Ala 


Asp 


Asp 


Val 


Tyr 


Ala 


Ser 


Tyr 


Gly 


Leu 


Ile 


Arg 


Gly 


Asn 


Phe 


Trp 


Asn 


Tyr 


Thr 


Ser 


Leu 


val 


Asn 


Tyr 


Tyr 


Gln 


Gln 


Tyr 
885 


Lys 
900 
915 


Leu 
930 


val 
945 


Phe 
960 


Pro 
975 


Val 
990 





Gly 
2005 


Asp 
2020 


Tyr 
2035 


Leu 
2050 


His 
2065 


Asn 
2080 


Glu 
2095 


Ala 
2110 


Asn 
2125 


Lys 
2140 


Met 
2155 


Leu 
2170 


Phe 
2185 


Lys 
2200 


Gly 
2215 


Phe 
2230 


Phe 
2245 


Met 
2260 


Arg 


Lys 


Phe 


Tyr 


Ser 


Asn 


Thr 


Cys 


Pro 


Ala 


Ala 


Lys 


Gln 


Phe 


Ala 


Asn 


Lys 


Leu 


Arg 


Thr 


Thr 


Arg 


Gly 


val 


Lys 


Thr 


Glu 


Tyr 


Ile 


Thr 


Pro 


Asn 


Lys 


Asn 


Glu 


Leu 


Val 


Tyr 


Gln 


Ser 


Asp 


val 


Lys 


Arg 


Gly 


Glu 


Asp 


val 


Asn 


Leu 


Thr 


Lys 


Gln 


Thr 


Lys 


Glu 


Asp 


Ala 


Leu 


His 


Leu 


Ser 


Lys 


Cys 


Asn 


Phe 


Leu 


Gly 


Tyr 


Thr 


Asn 


Val 


Ala 


Ile 


Leu 


Ala 


Thr 


Leu 


Val 


Ser 


Asp 


Leu 


US 7,776,521 BI 


Glu 


Gly 


Asn 


Asn 


Ala 


Lys 


His 


Ser 


Arg 


Met 


Asp 


Glu 


His 


Thr 


Thr 


Ile 


Asp 


Pro 


Ala 


Phe 


Arg 


Pro 


Ile 


Asp 


Arg 


Phe 


Glu 


-continued 


Leu 
1890 


Val 
1905 


Ala 
1920 


Leu 
1935 


Phe 
1950 


Gln 
1965 


Gly 
1980 


Ala 
1995 


His 
2010 


Met 
2025 


Thr 
2040 


Asn 
2055 


Ala 
2070 


Lys 
2085 


Leu 
2100 


Lys 
2115 


Tle 
2130 


Ala 
2145 


Lys 
2160 


Asp 
2175 


Asn 
2190 


Ser 
2205 


Gly 
2220 


Gly 
2235 


Asp 
2250 


Leu 
2265 


Gly 


Phe 


Cys 


Ile 


Pro 


His 


Leu 


Lys 


Thr 


His 


Ile 


Tyr 


Val 


Gly 


Val 


Pro 


Pro 


Ala 


His 


Lys 


Gly 


Gly 


Lys 


Glu 


Ile 


Leu 


Glu 


Tyr 


Tyr 


Leu 


Val 


Gly 


Thr 


Pro 


Gln 


Cys 


Ala 


Ser 


Asn 


Ala 


Glu 


Asp 


Val 


Val 


Ala 


Val 


Pro 


Arg 


Val 


Gly 


Ser 


Ile 


Glu 


Leu 


Ala 


Ser 


Phe 


Cys 


Cys 


Pro 


Phe 


Val 


Ile 


Asn 


Ala 


Leu 


Tyr 


Ala 


Gly 


Asn 


Pro 


Asn 


ser 


Thr 


Val 


Leu 


Pro 


Val 


Gln 


Asp 


Ala 


Phe 


102 


Glu 


Leu 


Leu 


Tyr 


Ser 


Ser 


Asp 


Val 


Pro 


Leu 


Pro 


Gln 


val 


Thr 


Asp 


Ile 


Ile 


Glu 


Ile 


Thr 


Phe 


Ser 


Glu 


Arg 


Met 


Leu 


2270 


His 
2285 
His 
2300 


Lys 
2315 





Phe 
2330 


Val 
2345 


Gln 
2360 


Tyr 
2375 


Glu 
2390 


Gly 
2405 


Glu 
2420 


Lys 
2435 


Tyr 
2450 


Ile 
2465 


Ala 
2480 


Ser 
2495 


Gly 
2510 


Ile 
2525 


Asn 
2540 


Lys 
2555 


Glu 
2570 


Ser 
2585 


Glu 
2600 


Gln 
2615 


Asn 
2630 


Ser 
2645 


Lys 
2660 


Ile 


Leu 


Leu 


Ile 


Ile 


Asp 


Ala 


Thr 


val 


Lys 


Gly 


Leu 


His 


val 


Asp 


Asp 


ser 


Asp 


Gln 


His 


Trp 


Ala 


Ile 


Thr 


Lys 


Glu 


Val 


Met 


Glu 


Thr 


Asp 


Leu 


Glu 


Phe 


Ala 


Cys 


Tle 


Asn 


Phe 


Leu 


Leu 


Cys 


Asp 


ser 


Lys 


ser 


Trp 


Phe 


Asp 


Asn 


Phe 


Asn 


Tyr 


Ile 


Asp 


Asp 


Leu 


Ser 


Ile 


Tyr 


Met 


Asp 


Met 


Thr 


Gly 


Arg 


Asn 


Ala 


Met 


Lys 


Leu 


Trp 


Thr 


Leu 


Gly 


Pro 


Pro 


Gln 


103 


Gly 


Gly 


Phe 


Ala 


Leu 


Val 


Ser 


Pro 


Pro 


Leu 


Met 


Leu 


Ala 


Gln 


Asp 


Thr 


Tyr 


Glu 


Ala 


Asn 


Ala 


Ile 


Tyr 


Ile 


Leu 


Ile 


2275 


Asp 
2290 


Leu 
2305 


Ile 
2320 


Gln 
2335 


Leu 
2350 


Ile 
2365 


Phe 
2380 


Lys 
2395 


Asn 
2410 


Gln 
2425 


Asn 
2440 


Thr 
2455 


Gly 
2470 


Trp 
2485 


Phe 
2500 


Val 
2515 


Asp 
2530 


Gly 
2545 


Leu 
2560 


Ala 
2575 


Phe 
2590 


Gly 
2605 


Thr 
2620 


Gln 
2635 


Lys 
2650 


Asn 
2665 


Phe 


Ala 


Pro 


Thr 


Asp 


Ser 


Met 


Leu 


Leu 


Asn 


Val 


Leu 


Ser 


Leu 


Val 


His 


Pro 


Phe 


Gly 


Asp 


val 


Ala 


Met 


Leu 


Leu 


Asp 


Ser 


Lys 


Met 


Gly 


Asp 


Lys 


Leu 


Gln 


Tyr 


Tyr 


Ala 


Ala 


Asp 


Pro 


Ser 


Thr 


Arg 


Phe 


Gly 


Leu 


Thr 


Asn 


His 


Ser 


Arg 


Met 


His 


Arg 


Asp 


Ser 


Phe 


Val 


Trp 


Ala 


Lys 


Gly 


Lys 


Val 


Lys 


Thr 


Asp 


Ala 


Thr 


Thr 


Ser 


Tyr 


Asn 


Tyr 


Ala 


Ser 


Gly 


Ile 


US 7,776,521 BI 


Gly 


Ser 


Ser 


Ser 


Val 


Val 


Cys 


Ser 


Met 


Glu 


Tyr 


Pro 


Gly 


Gly 


Ala 


Asn 


Lys 


Tyr 


Ile 


Lys 


Val 


Leu 


Asn 


Tyr 


Thr 


Tyr 


-continued 


2280 


Gln 
2295 


Gln 
2310 


Thr 
2325 


Lys 
2340 


Glu 
2355 


Lys 
2370 


Lys 
2385 


Gln 
2400 


Glin 
2415 


Asn 
2430 


Thr 
2445 


Tyr 
2460 


val 
2475 


Thr 
2490 


Asp 
2505 


Lys 
2520 
His 
2535 


Leu 
2550 


Ala 
2565 


Leu 
2580 


Asn 
2595 


Gly 
2610 


Tyr 
2625 


Ser 
2640 


Ala 
2655 


Ser 
2670 


Leu 


Asp 


Val 


Cys 


Ile 


Val 


Asp 


Ala 


Arg 


Ala 


Gln 


Asn 


Ala 


Leu 


Ser 


Trp 


Val 


Cys 


Val 


Met 


Ala 


Lys 


Ile 


Leu 


Val 


Leu 


Gly 


Ser 


Lys 


Val 


Ile 


Thr 


Gly 


Trp 


Met 


val 


Leu 


Met 


Pro 


Leu 


Thr 


Asp 


Thr 


Gly 


Lys 


Gly 


Ser 


Pro 


Phe 


Phe 


Met 


Leu 


Gly 


Pro 


Asn 


Cys 


Lys 


Ile 


His 


Glin 


Leu 


Ile 


Cys 


Arg 


Gly 


Val 


Leu 


Leu 


Lys 


Phe 


Ile 


His 


Ser 


Lys 


Trp 


Asp 


Ser 


Glu 


104 


105 


US 7,776,521 BI 


-continued 


Lys Gly Arg Leu Ile Ile Arg Glu Asn Asn Arg Val Val Val Ser 
2685 


Ser 


<210> 
<211> 
<212> 
<213> 


<400> 


2675 


Asp 


2690 


268 


0 


Ile Leu Val Asn Asn 


PRT 


SEQUENCE : 


Met Phe Ile Phe 


1 


Asp 


His 


Ser 


Asn 


65 


Ile 


val 


Ser 


Asn 


Gly 


145 


Phe 


Gly 


Phe 


Leu 


Gly 


225 


Ala 


Leu 


Thr 


Ser 


Phe 


305 


Asn 


Val 


Arg 


Thr 


Asp 


50 


Val 


Pro 


Val 


Val 


Phe 


130 


Thr 


Glu 


Asn 


Leu 


Pro 


210 


Ile 


Gln 


Lys 


Asp 


val 


290 


Arg 


Leu 


Tyr 


Cys 


Ser 


35 


Thr 


Thr 


Phe 


Arg 


Ile 


115 


Glu 


Gln 


Tyr 


Phe 


Tyr 


195 


Ser 


Asn 


Asp 


Pro 


Ala 


275 


Lys 


val 


Cys 


Ala 


Thr 


20 


Ser 


Leu 


Gly 


Lys 


Gly 


100 


Ile 


Leu 


Thr 


Ile 


Lys 


180 


Val 


Gly 


Ile 


Ile 


Thr 


260 


Val 


Ser 


Val 


Pro 


Trp 


SEQ ID NO 4 
LENGTH: 
TYPE: 
ORGANISM: Coronavirus 


1255 


4 


Leu 


5 


Thr 


Met 


Tyr 


Phe 


Asp 


85 


Trp 


Ile 


Cys 


His 


Ser 


165 


His 


Tyr 


Phe 


Thr 


Trp 


245 


Thr 


Asp 


Phe 


Pro 


Phe 


325 


Glu 


Leu 


Phe 


Arg 


Leu 


His 


70 


Gly 


val 


Asn 


Asp 


Thr 


150 


Asp 


Leu 


Lys 


Asn 


Asn 


230 


Gly 


Phe 


Cys 


Glu 


Ser 


310 


Gly 


Arg 


269 


Phe 


Asp 


Gly 


Thr 


55 


Thr 


Ile 


Phe 


Asn 


Asn 


135 


Met 


Ala 


Arg 


Gly 


Thr 


215 


Phe 


Thr 


Met 


Ser 


Ile 


295 


Gly 


Glu 


Lys 


5 


Leu 


Asp 


Val 


40 


Gln 


Ile 


Tyr 


Gly 


Ser 


120 


Pro 


Ile 


Phe 


Glu 


Tyr 


200 


Leu 


Arg 


Ser 


Leu 


Gln 


280 


Asp 


Asp 


val 


Lys 


Thr 


Val 


25 


Tyr 


Asp 


Asn 


Phe 


Ser 


105 


Thr 


Phe 


Phe 


Ser 


Phe 


185 


Gln 


Lys 


Ala 


Ala 


Lys 


265 


Asn 


Lys 


val 


Phe 


Ile 


Leu 


10 


Gln 


Tyr 


Leu 


His 


Ala 


90 


Thr 


Asn 


Phe 


Asp 


Leu 


170 


Val 


Pro 


Pro 


Ile 


Ala 


250 


Tyr 


Pro 


Gly 


Val 


Asn 


330 


Ser 


Thr 


Ala 


Pro 


Phe 


Thr 


75 


Ala 


Met 


Val 


Ala 


Asn 


155 


Asp 


Phe 


Ile 


Ile 


Leu 


235 


Ala 


Asp 


Leu 


Ile 


Arg 


315 


Ala 


Asn 


Ser 


Pro 


Asp 


Leu 


60 


Phe 


Thr 


Asn 


val 


val 


140 


Ala 


Val 


Lvs 


Asp 


Phe 


220 


Thr 


Tvr 


Glu 


Ala 


Tvr 


300 


Phe 


Thr 


Cvs 


Gly 


Asn 


Glu 


45 


Pro 


Gly 


Glu 


Asn 


Tle 


125 


Ser 


Phe 


Ser 


Asn 


Val 


205 


Lys 


Ala 


Phe 


Asn 


Glu 


285 


Gln 


Pro 


Lys 


Val 


Ser 


Tyr 


30 


Ile 


Phe 


Asn 


Lys 


Lys 


110 


Arg 


Lys 


Asn 


Glu 


Lys 


190 


Val 


Leu 


Phe 


Val 


Gly 


270 


Leu 


Thr 


Asn 


Phe 


Ala 


Asp 


15 


Thr 


Phe 


Tyr 


Pro 


Ser 


95 


Ser 


Ala 


Pro 


Cys 


Lys 


175 


Asp 


Arg 


Pro 


Ser 


Gly 


255 


Thr 


Lys 


Ser 


Ile 


Pro 


335 


Asp 


Leu 


Gln 


Ser 


Val 


80 


Asn 


Gln 


Cys 


Met 


Thr 


160 


Ser 


Gly 


Asp 


Leu 


Pro 
240 


Tyr 


Ile 


Cys 


Asn 


Thr 


320 


Ser 


Tyr 


106 


Ser 


Val 


Asp 


385 


Gln 


Met 


Thr 


Arg 


Lys 


465 


Tyr 


val 


Pro 


Phe 


Phe 


545 


Ser 


Ser 


Glu 


Ala 


Gly 


625 


His 


Cys 


ser 


Tyr 


Thr 


705 


Asn 


Glin 


Ala 


val 


Ser 


370 


Ser 


Thr 


Gly 


Gly 


Pro 


450 


Pro 


Gly 


Val 


Lys 


Asn 


530 


Gln 


Val 


Phe 


Val 


Ile 


610 


Asn 


val 


Ala 


Ile 


Ser 


690 


Thr 


Met 


Tyr 


Ala 


Leu 


355 


Ala 


Phe 


Gly 


Cys 


Asn 


435 


Phe 


Cys 


Phe 


Leu 


Leu 


515 


Gly 


Pro 


Arg 


Gly 


Ala 


595 


His 


Asn 


Asp 


Ser 


val 


675 


Asn 


Glu 


Tyr 


Gly 


Glu 
755 


340 


Tyr 


Thr 


val 


val 


val 


420 


Tyr 


Glu 


Thr 


Tyr 


Ser 


500 


Ser 


Leu 


Phe 


Asp 


Gly 


580 


Val 


Ala 


val 


Thr 


Tyr 


660 


Ala 


Asn 


val 


Ile 


Ser 


740 


Gln 


Asn 


Lys 


Val 


Ile 


405 


Leu 


Asn 


Arg 


Pro 


Thr 


485 


Phe 


Thr 


Thr 


Gln 


Pro 


565 


val 


Leu 


Asp 


Phe 


Ser 


645 


His 


Tyr 


Thr 


Met 


Cys 


725 


Phe 


Asp 


Ser 


Leu 


Lys 


390 


Ala 


Ala 


Tyr 


Asp 


Pro 


470 


Thr 


Glu 


Asp 


Gly 


Gln 


550 


Lys 


Ser 


Tyr 


Gln 


Gln 


630 


Tyr 


Thr 


Thr 


Tle 


Pro 


710 


Gly 


Cys 


Arg 


107 


Thr 
Asn 
375 
Gly 
Asp 
Trp 
Lys 
Ile 
455 
Ala 
Thr 
Leu 
Leu 
Thr 
535 
Phe 
Thr 
val 
Gln 
Leu 
615 
Thr 
Glu 
Val 
Met 
Ala 
695 
Val 
Asp 


Thr 


Asn 


Phe 


360 


Asp 


Asp 


Tyr 


Asn 


Tyr 


440 


Ser 


Leu 


Gly 


Leu 


Ile 


520 


Gly 


Gly 


Ser 


Ile 


Asp 


600 


Thr 


Gln 


Cys 


Ser 


Ser 


680 


Ile 


Ser 


Ser 


Gln 


Thr 
760 


345 


Phe 


Leu 


Asp 


Asn 


Thr 


425 


Arg 


Asn 


Asn 


Ile 


Asn 


505 


Lys 


Val 


Arg 


Glu 


Thr 


585 


Val 


Pro 


Ala 


Asp 


Leu 


665 


Leu 


Pro 


Met 


Thr 


Leu 


745 


Arg 


Ser 


Cys 


Val 


Tyr 


410 


Arg 


Tyr 


Val 


Cys 


Gly 


490 


Ala 


Asn 


Leu 


Asp 


Ile 


570 


Pro 


Asn 


Ala 


Gly 


Ile 


650 


Leu 


Gly 


Thr 


Ala 


Glu 


730 


Asn 


Glu 


Thr 


Phe 


Arg 


395 


Lys 


Asn 


Leu 


Pro 


Tyr 


475 


Tyr 


Pro 


Gln 


Thr 


Val 


555 


Leu 


Gly 


Cys 


Trp 


Cys 


635 


Pro 


Arg 


Ala 


Asn 


Lys 


416 


Cys 


Arg 


Val 
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-continued 


Phe 


Ser 


380 


Gln 


Leu 


Ile 


Arg 


Phe 
460 


Trp 


Gln 


Ala 


Cys 


Pro 


540 


Ser 


Asp 


Thr 


Thr 


Arg 


620 


Leu 


Ile 


Ser 


Asp 


Phe 


700 


Thr 


Ala 


Ala 


Phe 


Lys 


365 


Asn 


Tle 


Pro 


Asp 


His 


445 


Ser 


Pro 


Pro 


Thr 


Val 


525 


Ser 


Asp 


Ile 


Asn 


Asp 


605 


Ile 


Ile 


Gly 


Thr 


Ser 


685 


Ser 


Ser 


Asn 


Leu 


Ala 
765 


350 


Cys 


Val 


Ala 


Asp 


Ala 


430 


Gly 


Pro 


Leu 


Tyr 


val 


510 


Asn 


Ser 


Phe 


Ser 


Ala 


590 


Val 


Tyr 


Gly 


Ala 


Ser 


670 


Ser 


Ile 


Val 


Leu 


Ser 


750 


Glin 


Tyr 


Tyr 


Pro 


Asp 


415 


Thr 


Lys 


Asp 


Asn 


Arg 


495 


Cys 


Phe 


Lys 


Thr 


Pro 


575 


Ser 


Ser 


Ser 


Ala 


Gly 


655 


Glin 


Ile 


Ser 


Asp 


Leu 


735 


Gly 


Val 


Gly 


Ala 


Gly 


400 


Phe 


Ser 


Leu 


Gly 


Asp 


480 


Val 


Gly 


Asn 


Arg 


Asp 


560 


Cys 


Ser 


Thr 


Thr 


Glu 


640 


Ile 


Lys 


Ala 


Ile 


Cys 
720 
Leu 


Ile 


Lys 


108 


Glin 


Ser 


785 


Glu 


Lys 


Cys 


Asp 


Thr 


865 


Ala 


val 


Ile 


Lys 


val 


945 


Asp 


Arg 


Gln 


Thr 


Phe 


Pro 


Glu 


Ala 


Trp 


Thr 


Ile 


Ser 


Pro 


Val 


Met 
770 


Gln 


Asp 


Gln 


Ala 


Asp 


850 


Ala 


Met 


Leu 


Ser 


Leu 


930 


Lys 


Ile 


Leu 


Leu 


Lys 


1010 


Cys 
1025 


His 
1040 


Arg 
1055 


Tyr 
070 


Phe 
085 


Asp 
00 








Tyr 


Ile 


Leu 


Tyr 


Gln 


835 


Met 


Gly 


Gln 


Tyr 


Gln 


915 


Gln 


Gln 


Leu 


Ile 


Ile 


995 


Met 


Gly 


Gly 


Asn 


Phe 


Tle 


Asn 


Asn 


Lys 


Val 


Tle 


Lys 


Leu 


Leu 


Gly 


820 


Lys 


Thr 


109 


Pro Thr Leu Lys 


775 


Pro Asp Pro Leu Lys 


790 


Phe Asn Lys Val Thr 


805 


Glu 


Cys Leu Gly Asp 


825 


Phe Asn Gly Leu Thr 
840 


Ile Ala 


Trp 


Thr 


Met Ala 


885 


Glu Asn 


900 


Ile 


Asp 


Leu 


Gln 


val 


Ser 


Ser Arg 


Thr 
980 


Arg 


965 


Gly 


Ala 


Ala Tyr Thr Ala 


855 


Phe Gly Ala Gly 


870 


Tyr Arg Phe Asn 


Gln Lys Gln Ile 


905 


Glu Ser Leu Thr 
920 


Val Asn Gln Asn 


935 


Ser Asn Phe Gly 


950 


Leu Asp Lys Val 


Arg Leu Gln Ser 


Ala Glu Ile 


985 


1000 


Ser Glu Cys Val 


Lys 


val 


Phe 


Pro 


Thr 


Thr 


Asn 


Glu 


Asp 


Glin 


Gly 


val 


Thr 


Arg 


Gln 


Phe 


Thr 


Glu 


Leu 


Lys 


1015 


Tyr His 
1030 


Phe Leu 
1045 


Thr Ala 
1060 


Glu Gly 
1075 


Arg Asn 
1090 


Val Ser 
1105 


Val Tyr 
1120 


Leu Asp 
1135 


Gly Asp 
1150 


Glu Ile 
1165 


Leu Gly Gln 


Leu Met 


His Val 


Pro Ala 


Val Phe 


Phe Phe 


Gly Asn 


Asp Pro 


Lys Tyr 


Ile Ser 


Asp Arg 


Tyr 


Pro 


Leu 


810 


Ile 


Val 


Ala 


Ala 


Gly 


890 


Ala 


Thr 


Ala 


Ala 


Glu 


970 


Leu 


Ser 


Thr 


Ile 


Val 


Ser 


Cys 


Leu 


Phe 


Gly 


Leu 


Phe 


Thr 


795 


Ala 


Asn 


Leu 


Leu 


Ala 


875 


Ile 


Asn 


Thr 


Gln 


Ile 


955 


Ala 


Gln 


Phe 


Tyr 


Cys 


Phe 


Pro 


Asp 


Gln 


Lys 


Ile 


Asn 
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Gly 


780 


Lys 


Asp 


Ala 


Pro 


Val 


860 


Leu 


Gly 


Gln 


Ser 


Ala 


940 


Ser 


Glu 


Thr 


Gly 


Arg 


Ala 


Arg 


Pro 


845 


Ser 


Gln 


Val 


Phe 


Thr 


925 


Leu 


Ser 


val 


Tyr 


Phe 


Ser 


Gly 


Asp 


830 


Leu 


Gly 


Ile 


Thr 


Asn 


910 


Ala 


Asn 


val 


Gln 


Val 
990 


1005 


Ser Lys 


1020 


Pro 


1035 


Val 


1050 


His 


1065 


Asn 


1080 


Gln 


1095 


Val 


1110 


Pro 


1125 


Asn 


1140 


Asn 


1155 


Glu 


1170 


Arg 


Gln 


Val 


Pro 


Glu 


Gly 


Ile 


Val 


Glu 


His 


Ala 


Ala 


Leu 


Ser 


Asn 


Phe 


Phe 


815 


Leu 


Leu 


Thr 


Pro 


Gln 


895 


Lys 


Leu 


Thr 


Leu 


Ile 


975 


Thr 


Ser 


Gly 


Thr 


Ile 


Ile 


Thr 


Ala 


Phe 


Tle 


800 


Met 


Ile 


Thr 


Ala 


Phe 


880 


Asn 


Ala 


Gly 


Leu 


Asn 


960 


Asp 


Gln 


Arg Ala Ser Ala Asn Leu Ala Ala 


Val Asp 


Ala 


Gln 


Lys 


Ser 


Thr 


Gly 


Asp 


Ser 


Val 


Lys 


110 


Asn 


Glu 


Ala 


Met 


Ser 


Gly 


<210> 
<211> 
<212> 
<213> 


<400> 


Met 


1 


Pro 


Ala 


Gly 


Leu 


65 


Cys 


val 


Tyr 


Leu 


Tyr 


145 


Asn 


Thr 


Arg 


Glu 


Ile 


225 


Pro 


Pro 


Pro 


Leu 
175 


Gln 
190 


Gly 
205 
220 


Cys 
235 


val 
250 





Asp 


Val 


Thr 


Val 


50 


Asn 


Asn 


Ala 


Phe 


Cys 


130 


Phe 


Ser 


Pro 


His 


val 


210 


Glu 


Asn 


Ala 


Leu 


Asn 


Tyr 


Leu 


Ser 


Cys 


Lys 


Leu 


Lys 


Ile 


35 


Ala 


Lys 


Leu 


Ala 


Leu 


115 


Trp 


val 


val 


Lys 


Ser 


195 


Tyr 


Asn 


val 


Met 


Glu Ser 


Tle 


Ile 


Cys 


Lys 


Lys 


Ala 


Cys 


Phe 


Leu His 


PRT 


ORGANISM: 


SEQUENCE: 


Phe 


Ile 


20 


Pro 


Phe 


Arg 


Leu 


Gly 


100 


Gln 


Lys 


Cys 


Thr 


Leu 


180 


Gly 


Tyr 


Ala 


Gln 


Asp 
260 


SEQ ID NO 5 
LENGTH: 27 
TYPE: 


4 


111 


Leu 


Trp 


Ile 


Ser 


Asp 


Tyr 


Ile 
180 


Pro 
195 


Val 
210 


Cys 
225 


Glu 
240 


Thr 
255 





Coronavirus 


5 


Met 


Asp 


Leu 


Leu 


Trp 


Leu 


85 


Met 


Cys 


Cys 


Trp 


Asp 


165 


Lys 


Val 


Gln 


Thr 


Ile 


245 


Pro 


Arg 


Asn 


Asp Leu 


Trp Tyr 


Met Val 


Leu Lys 


Asp Asp 


Phe Phe Thr 


Ala Se 


Gln Ala Se 


Ala 


Gln 
70 


Leu 


40 


val 
55 


Leu Al 


Phe Va 


Glu Ala Gl 


Ile Asn Al 


Lys 


His 
150 


Thr 


Glu Asp 


Lys Asp 


Leu 


Phe 


230 


His 


Tle 


12 


T Pro 
25 


r Leu 


Phe Gln 


a Leu 


1 Thr 


n Phe 
105 


a Cys 
0 


Ser Lys Asn 


135 


Thr Hi 


Ile Va 


20 
Glu Se 
215 
Phe Il 


Thr Il 


Tyr As 


s Asn 


1 Val 


Tyr Gln 


185 


Tyr Val 


0 


r Thr 


e Phe 


e Asp 


p Glu 
265 


Leu 


10 


Ala 


Pro 


Ser 


Tyr 


Ile 


90 


Leu 


Arg 


Pro 


Tyr 


Thr 


170 


Ile 


val 


Gln 


Asn 


Gly 


250 


Pro 


Gln 


Val 


Thr 


Gly 


Ser 


Gly 


Ser 


Phe 


Ala 


Lys 


75 


Tyr 


Tyr 


Ile 


Leu 


Asp 


155 


Glu 


Gly 


Val 


Ile 


Lys 


235 


Ser 


Thr 


Glu 


Trp 


Ile 


Ala 


Glu 


US 7,776,521 BI 


-continued 


1185 


Leu 


1200 


Leu 


1215 


Cys 


1230 


Pro 


1245 


Ser 


Thr 


Gly 


Thr 


60 


Gly 


Ser 


Leu 


Tle 


Leu 


140 


Tyr 


Gly 


Gly 


His 


Thr 


220 


Leu 


Ser 


Thr 


Ile 


val 


Trp 


45 


Lys 


Phe 


His 


Tyr 


Met 


125 


Tyr 


Cys 


Asp 


Tyr 


Gly 


205 


Thr 


Val 


Gly 


Thr 


Gly 


Leu 


Ser 


Val 


Thr 


His 


30 


Leu 


Ile 


Gln 


Leu 


Ala 


110 


Arg 


Asp 


Ile 


Gly 


Ser 
190 


Tyr 


Asp 


Lys 


Val 


Thr 
270 


Leu Gly Lys 


Phe 


Cys 


Cys 


Leu 


Ala 


15 


Ala 


Val 


Ile 


Phe 


Leu 


95 


Leu 


Cys 


Ala 


Pro 


Ile 


175 


Glu 


Phe 


Thr 


Asp 


Ala 


255 


Ser 


Tyr 


Ile 


Cys 


Gly 


Lys 


Gln 


Thr 


Ile 


Ala 


Ile 


80 


Leu 


Ile 


Trp 


Asn 


Tyr 


160 


Ser 


Asp 


Thr 


Gly 


Pro 


240 


Asn 


Val 


112 


113 


<210> SEQ ID NO 6 

<211> LENGTH: 154 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 6 


Met Met Pro Thr Thr Leu Phe 
1 5 


Val Tyr His Ile Thr Val Ser 
20 


Thr Ala Phe Gln His Gln Asn 
35 


Ile Leu Arg Ile Gly Thr Gln 
50 55 


Ala Ile Ser Pro Lys Phe Thr 
65 70 


Gln Thr Leu Val Leu Lys Met 
85 


Leu Lys Thr His Arg Met Cys 
100 


Glu Leu Leu Ile Gln Gln Trp 
115 


Leu Leu Ala Cys Leu Cys Lys 
130 135 


Thr His Ser Phe Arg Lys Lys 
145 150 


<210> SEQ ID NO 7 

<211> LENGTH: 76 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 7 


Met Tyr Ser Phe Val Ser Glu 
1 5 


Val Leu Leu Phe Leu Ala Phe 
20 


Ile Leu Thr Ala Leu Arg Leu 
35 


Val Ser Leu Val Lys Pro Thr 
50 55 


Leu Asn Ser Ser Glu Gly Val 
65 70 


<210> SEO ID NO 8 

<211> LENGTH: 221 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 8 


Met Ala Asp Asn Gly Thr Ile 
L 5 


Glu Gln Trp Asn Leu Val Ile 
20 


Leu Leu Gln Phe Ala Tyr Ser 
35 


Lys Leu Val Phe Leu Trp Leu 


Ala 


Gln 


Ser 


40 


Val 


Thr 


Leu 


Lys 


Ile 


120 


His 


Gln 


Glu 


val 


Cys 


40 


val 


Pro 


Thr 


Gly 


Asn 


40 


Leu 


Gly 


Ile 


25 


Lys 


Leu 


Ser 


His 


Tyr 


105 


Gln 


Lys 


Val 


Thr 


Val 


25 


Ala 


Tyr 


Asp 


Val 
Phe 
25 


Arg 


Trp 


Thr 


10 


Gln 


Lys 


Lys 


Leu 


Ser 


90 


Thr 


Phe 


Lys 


Arg 


Gly 
10 

Phe 
Tyr 


Val 


Leu 


Glu 
10 
Leu 


Asn 


Pro 


His 


Leu 


Thr 


Thr 


Ser 


75 


Ser 


Gln 


Met 


Val 


Thr 


Leu 


Cys 


Tyr 


Leu 
75 


Glu 


Phe 


Arg 


Val 
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Ile 


Ser 


Thr 


Met 


60 


Leu 


Ser 


Ser 


Met 


Ser 
140 


Leu 


Leu 


Cvs 


Ser 


60 


Val 


Leu 


Leu 


Phe 


Thr 


Thr 


Leu 


Lvs 


45 


Ser 


His 


Leu 


Thr 


Ser 


125 


Thr 


Ile 


Val 


Asn 


45 


Arg 


Lvs 


Ala 


Leu 


45 


Leu 


Met 


Leu 


30 


Leu 


Leu 


Lvs 


Thr 


Ala 


110 


Arg 


Asn 


Val 
Thr 
30 


Ile 


Val 


Glin 


Trp 
30 


Tyr 


Ala 


Thr 


15 


Lys 


Val 


Tyr 


Leu 


Ser 


95 


Leu 


Arg 


Leu 


Asn 
15 
Leu 


Val 


Lys 


Leu 
15 
Ile 


Ile 


Cys 


Thr 


Val 


Val 


Met 


Leu 


80 


Leu 


Gln 


Arg 


Cys 


Ser 


Ala 


Asn 


Asn 


Leu 


Met 


Ile 


Phe 


114 


US 7,776,521 
115 
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50 55 60 


Val Leu Ala Ala Val Tyr Arg Ile Asn Trp Val Thr Gly Gly Ile 
65 70 75 


Ile Ala Met Ala Cys Ile Val Gly Leu Met Trp Leu Ser Tyr Phe 
85 90 95 


Ala Ser Phe Arg Leu Phe Ala Arg Thr Arg Ser Met Trp Ser Phe 
100 105 110 


Pro Glu Thr Asn Ile Leu Leu Asn Val Pro Leu Arg Gly Thr Ile 
115 120 125 


Thr Arg Pro Leu Met Glu Ser Glu Leu Val Ile Gly Ala Val Ile 
130 135 140 


Arg Gly His Leu Arg Met Ala Gly His Pro Leu Gly Arg Cys Asp 
145 150 155 


Lys Asp Leu Pro Lys Glu Ile Thr Val Ala Thr Ser Arg Thr Leu 
165 170 175 


Tyr Tyr Lys Leu Gly Ala Ser Gln Arg Val Gly Thr Asp Ser Gly 
180 185 190 


Ala Ala Tyr Asn Arg Tyr Arg Ile Gly Asn Tyr Lys Leu Asn Thr 
195 200 205 


His Ala Gly Ser Asn Asp Asn Ile Ala Leu Leu Val Gln 
210 215 220 


<210> SEQ ID NO 9 

<211> LENGTH: 63 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 9 


Met Phe His Leu Val Asp Phe Gln Val Thr Ile Ala Glu Ile Leu 
1 5 10 15 


Ile Ile Met Arg Thr Phe Arg Ile Ala Ile Trp Asn Leu Asp Val 
20 25 30 


Ile Ser Ser Ile Val Arg Gln Leu Phe Lys Pro Leu Thr Lys Lys 
35 40 45 


Tyr Ser Glu Leu Asp Asp Glu Glu Pro Met Glu Leu Asp Tyr Pro 
50 55 60 


<210> SEQ ID NO 10 

<211> LENGTH: 122 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 10 


Met Lys Ile Ile Leu Phe Leu Thr Leu Ile Val Phe Thr Ser Cys 
1 5 10 15 


Leu Tyr His Tyr Gln Glu Cys Val Arg Gly Thr Thr Val Leu Leu 
20 25 30 


Glu Pro Cys Pro Ser Gly Thr Tyr Glu Gly Asn Ser Pro Phe His 
BB 40 45 


Leu Ala Asp Asn Lvs Phe Ala Leu Thr Cvs Thr Ser Thr His Phe 
50 55 60 


Phe Ala Cys Ala Asp Gly Thr Arg His Thr Tyr Gln Leu Arg Ala 
65 70 75 


Ser Val Ser Pro Lys Leu Phe Ile Arg Gln Glu Glu Val Gln Gln 
85 90 95 


Leu Tyr Ser Pro Leu Phe Leu Ile Val Ala Ala Leu Val Phe Leu 


Bl 


Ala 
80 

Val 
Asn 
Val 
Ile 
Ile 
160 
Ser 


Phe 


Asp 


Ile 
Ile 


Asn 


Glu 
Lys 
Pro 
Ala 
80 

Glu 


Ire 


116 


100 


117 


Leu Cys Phe Thr Ile Lys Arg 


<210> 
<211> 
<212> 
<213> 


<400> 


115 


PRT 


SEQUENCE : 


Met Cys Leu Lys 


d 


Ser 


Trp 


Gln 


Glu 


65 


Lys 


<210> 
<211> 
<212> 
<213> 


<400> 


Thr 
His 
Asp 
50 


Gly 


Arg 


Ala 


Thr 


35 


Pro 


His 


Thr 


Trp 


20 


Met 


Ala 


Gln 


Asn 


ORGANISM: 


SEQUENCE : 


Met Ser Asp Asn 


1 


Thr 


Arg 


Asn 


Leu 


65 


Pro 


Gly 


Tyr 


Glu 


Asp 


145 


Glin 


ser 


Gly 


Ala 


Phe 


Asn 


Thr 


50 


Arg 


Asp 


Gly 


Leu 


Gly 


130 


His 


Leu 


Arg 


Asn 


Arg 
210 


Gly 


Gly 


35 


Ala 


Phe 


Asp 


Asp 


Gly 


215. 


Ile 


Ile 


Pro 


Gly 


Ser 


195 


Met 


Gly 


20 


Ala 


Ser 


Pro 


Gln 


Gly 


100 


Thr 


val 


Gly 


Gln 


Gly 


180 


Arg 


Ala 


84 


SEQ ID NO 11 
LENGTH: 
TYPE: 
ORGANISM: 


Coronavirus 


LL 


Ile 


Leu 


Val 


Gly 


Thr 


SEQ ID NO 12 
LENGTH: 
TYPE: PRT 


422 


Leu 


Cys 


Gln 


Gly 


Ala 
70 


Val 


Ala 


Thr 


Ala 


55 


Ala 


Coronavirus 


12 


Gly 


Pro 


Arg 


Trp 


Arg 


Ile 


85 


Lys 


Gly 


Trp 


Thr 


Gly 


165 


Ser 


Asn 


Ser 


Pro 


Thr 


Pro 


Phe 


Gly 


70 


Gly 


Met 


Pro 


Val 


Arg 


150 


Thr 


Gln 


Ser 


Gly 


Gln 


Asp 


Lys 


Thr 


55 


Gln 


Tyr 


Lys 


Glu 


Ala 


135 


Asn 


Thr 


Ala 


Thr 


Gly 
215 


105 


Lys Thr Glu 


120 


Arg 


Leu 


Cys 


40 


Leu 


Phe 


Ser 


Ser 


Gln 


40 


Ala 


Gly 


Tyr 


Glu 


Ala 


120 


Thr 


Pro 


Leu 


Ser 


Pro 


200 


Gly 


Tyr 


Gly 


25 


Thr 


Ile 


Arg 


Asn 


Thr 


25 


Arg 


Leu 


val 


Arg 


Leu 


105 


Ser 


Glu 


Asn 


Pro 


Ser 


185 


Gly 


Glu 


Asn 


10 


Lys 


Pro 


Ala 


Asp 


Gln 


10 


Asp 


Arg 


Thr 


Pro 


Arg 


90 


Ser 


Leu 


Gly 


Asn 


Lys 


170 


Arg 


Ser 


Thr 


Thr 


Val 


Asn 


Arg 


Val 
75 


Arg 


Asn 


Pro 


Gln 


Ile 


75 


Ala 


Pro 


Pro 


Ala 


Asn 


155 


Gly 


Ser 


Ser 


Ala 
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Arg 


Leu 


Val 


Cvs 


Leu 


Ser 


Asn 


Gln 


His 


60 


Asn 


Thr 


Arg 


Tyr 


Leu 


140 


Ala 


Phe 


Ser 


Arg 


Leu 
220 


Gly 


Pro 


Thr 
45 


Trp 


val 


Ala 


Glin 


Gly 


45 


Gly 


Thr 


Arg 


Trp 


Gly 


125 


Asn 


Ala 


Tyr 


Ser 


Gly 


205 


Ala 


110 


Asn 
Phe 
30 


Ile 


Tyr 


Val 


Pro 


Asn 


30 


Leu 


Lys 


Asn 


Arg 


Tyr 


TIO 


Ala 


Thr 


Thr 


Ala 


Arg 


190 


Asn 


Leu 


Thr 
15 

His 
Asn 


Leu 


Leu 


Arg 


15 


Gly 


Pro 


Glu 


Ser 


Val 


95 


Phe 


Asn 


Pro 


Val 


Glu 


175 


Ser 


Ser 


Leu 


Tyr 


Arg 


Cys 


His 


Asn 
80 


Ile 


Gly 


Asn 


Glu 


Gly 


80 


Arg 


Tyr 


Lys 


Lys 


Leu 


160 


Gly 


Arg 


Pro 


Leu 


118 
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Leu Asp Arg Leu Asn Gln Leu Glu Ser Lys Val Ser Gly 


225 


230 235 


Gln Gln Gln Gly Gln Thr Val Thr Lys Lys Ser Ala Ala 


245 250 


Lys Lys Pro Arg Gln Lys Arg Thr Ala Thr Lys Gln Tyr 


Gln A 


Asp G 


260 265 


a Phe Gly Arg Arg Gly Pro Glu Gln Thr Gln Gly 
275 280 285 


n Asp Leu Ile Arg Gln Gly Thr Asp Tyr Lys His 


290 295 300 


Ile A 
305 


Ile G 


Ala I 





a Gln Phe Ala Pro Ser Ala Ser Ala Phe Phe Gly 
310 315 


y Met Glu Val Thr Pro Ser Gly Thr Trp Leu Thr 


325 330 


e Lys Leu Asp Asp Lys Asp Pro Gln Phe Lys Asp 
340 345 


Leu Leu Asn Lys His Ile Asp Ala Tyr Lys Thr Phe Pro 


355 360 365 


Pro Lys Lys Asp Lys Lys Lys Lys Thr Asp Glu Ala Gln 
370 375 380 


Gln Arg Gln Lys Lys Gln Pro Thr Val Thr Leu Leu Pro 


385 


390 395 


Met Asp Asp Phe Ser Arg Gln Leu Gln Asn Ser Met Ser 


405 410 


Ala Asp Ser Thr Gln Ala 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


<400> 


420 


SEQ ID NO 13 

LENGTH: 21 

TYPE: DNA 

ORGANISM: Artificial Sequence 

FEATURE: 

OTHER INFORMATION: Synthetic oligonucleotide. 


SEQUENCE: 13 


ctaacatgct taggataatg g 


<210> 
<211> 
<212> 
<213> 
<220> 
«223» 


«400» 


SEQ ID NO 14 

LENGTH: 21 

TYPE: DNA 

ORGANISM: Artificial Sequence 

FEATURE: 

OTHER INFORMATION: Synthetic oligonucleotide. 


SEQUENCE: 14 


gectctcttg ttettgeteg c 


«210» 
«211» 
«212» 
«213» 
«220» 
«223» 


«400» 


SEO ID NO 15 


LENGTH: 21 

TYPE: DNA 

ORGANISM: Artificial Sequence 

FEATURE: 

OTHER INFORMATION: Synthetic oligonucleotide. 


SEQUENCE: 15 


caggtaagcg taaaactcat c 


«210» 
«211» 


SEQ ID NO 16 
LENGTH: 22 


Lys 


Glu 


Asn 


270 


Asn 


Trp 


Met 


Tyr 


Asn 


350 


Pro 


Pro 


Ala 


Gly 


Gly 


Ala 


255 


Val 


Phe 


Pro 


Ser 


His 


335 


Val 


Thr 


Leu 


Ala 


Ala 
415 


Gln 


240 


Ser 


Thr 


Gly 


Gln 


Arg 


320 


Gly 


Ile 


Glu 


Pro 


Asp 


400 


Ser 


21 


21 


21 


120 


US 7,776,521 BI 
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<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 16 


catgtgtggc ggetcactat at 22 


<210> SEQ ID NO 17 

<211> LENGTH: 28 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 17 


gacactatta gcataagcag ttgtagca 28 


<210> SEQ ID NO 18 

<211> LENGTH: 27 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 18 


ttaaaccagg tggaacatca tccggtg 27 


«210» SEQ ID NO 19 

«211» LENGTH: 22 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 19 


ggagccttga atacacccaa ag 22 


«210» SEQ ID NO 20 

«211» LENGTH: 17 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 20 


gcacggtggc agcattg 17 


«210» SEQ ID NO 21 

«211» LENGTH: 21 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 21 


ccacattgge acccgcaatc c 21 


<210> SEQ ID NO 22 

<211> LENGTH: 19 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 22 


US 7,776,521 BI 
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caaacattgg ccgcaaatt 19 


<210> SEQ ID NO 23 

<211> LENGTH: 21 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 23 


caatgcgtga cattccaaag a 21 


<210> SEQ ID NO 24 

<211> LENGTH: 26 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 24 


cacaatttgc tccaagtgcc tctgca 26 


«210» SEQ ID NO 25 

«211» LENGTH: 21 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 25 


gaagtaccat ctggggctga g 21 


«210» SEQ ID NO 26 

«211» LENGTH: 18 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 26 


ccgaagagct acccgacg 18 


«210» SEQ ID NO 27 

«211» LENGTH: 26 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 27 


etctttcatt ttgccgtcac caccac 26 


«210» SEQ ID NO 28 

«211» LENGTH: 24 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 28 

agctctccct agcattattc actg 24 
«210» SEQ ID NO 29 

«211» LENGTH: 20 


«212» TYPE: DNA 
«213» ORGANISM: Artificial Sequence 
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<220> FEATURE: 
<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 29 


caccacattt tcatcgaggc 20 


<210> SEQ ID NO 30 

<211> LENGTH: 22 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 30 


taccctcgat cgtactccge gt 22 


<210> SEQ ID NO 31 

<211> LENGTH: 23 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 31 


tgtaggcact gattcaggtt ttg 23 


<210> SEQ ID NO 32 

<211> LENGTH: 23 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 32 


cggegtggtc tgtatttaat tta 23 


«210» SEQ ID NO 33 

«211» LENGTH: 27 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 33 


ctgcatacaa ccgctaccgt attggaa 27 


«210» SEQ ID NO 34 

«211» LENGTH: 24 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 


«400» SEQUENCE: 34 


gggttgggac tatcctaagt gtga 24 


«210» SEQ ID NO 35 

«211» LENGTH: 21 

«212» TYPE: DNA 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Synthetic oligonucleotide. 
«220» FEATURE: 

«221» NAME/KEY: misc feature 

«222» LOCATION: (12)..(12) 

«223» OTHER INFORMATION: "n^ equals inosine. 
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-continued 


<400> SEQUENCE: 35 


taacacacaa cnccatcatc a 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


SEQ ID NO 36 

LENGTH: 19 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 36 


agatttggac ctgegageg 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


SEQ ID NO 37 

LENGTH: 20 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 37 


gageggctgt ctccacaagt 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


SEQ ID NO 38 

LENGTH: 23 

TYPE: DNA 

ORGANISM: Artificial Sequence 

FEATURE: 

OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 38 


ttctgacctg aaggctctgc geg 


The invention claimed is: 
1. A method of detecting a severe acute respiratory syn- 
drome-associated coronavirus (SARS-CoV) in a sample 
comprising: 
contacting the sample with a pair of nucleic acid primers 
that hybridize to a SARS-CoV nucleic acid, wherein at 
least one primeris 5'-end labeled with a reporter dye, and 
wherein at least one of the primers comprises the 
sequence as set forth in any one of SEQ ID NOs: 13-15; 

amplifying the SARS-CoV nucleic acid or a fragment 
thereof from the sample utilizing the pair of nucleic acid 
primers; 

electrophoresing the amplified products; and 

detecting the 5'-end labeled reporter dye, thereby detecting 

a SARS-CoV. 
2. The method of claim 1, wherein the amplification uti- 
lizes reverse transcriptase-polymerase chain reaction. 
3. A method of detecting a severe acute respiratory syn- 
drome-associated coronavirus (SARS-CoV) in a sample, 
comprising: 
contacting the sample with a pair of nucleic acid primers 
that hybridize to a SARS-CoV nucleic acid, wherein at 
least one of the nucleic acid primers comprises the 
sequence as set forth in any one of SEQ ID NOs: 13-15; 

amplifying the SARS-CoV nucleic acid or a fragment 
thereof from the sample utilizing the pair of nucleic acid 
primers; 
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adding to the amplified SARS-CoV nucleic acid or the 
fragment thereof a SARS-CoV probe that hybridizes to 
the SARS-CoV nucleic acid, wherein the SARS-CoV 
probe is labeled with a 5'-reporter dye and a 3'-quencher 
dye; 

performing one or more additional rounds of amplification 
with Taq DNA polymerase; and 

detecting fluorescence of the 5'-reporter dye, thereby 
detecting a SARS-CoV. 


4. Akit for detecting a severe acute respiratory syndrome- 
associated coronavirus (SARS-CoV) ina sample, comprising 
a pair of nucleic acid primers that hybridize under stringent 
conditions toa SARS-CoV nucleic acid, wherein at least one 
of the primers comprises the sequence as set forth in any one 
of SEQ ID NOs: 13-15. 

5. The kit of claim 4, wherein one primer is 5'-end labeled 
with a reporter dye. 

6. The kit of claim 4, further comprising a SARS-CoV 
probe that hybridizes to the SARS-CoV nucleic acid ampli- 
fied by the pair of primers, wherein the SARS-CoV probe is 
labeled with a 5'-reporter dye and a 3'-quencher dye. 

7. The kit of claim 4, further comprising an isolated SARS- 
CoV organism. 


